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COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF BREAST CANCER 

BACKGROUND OF THE INVENTION 

5 Field of the Invention 

The present invention relates generally to therapy and diagnosis 
of cancer, such as breast cancer. The invention is more specifically related to 
polypeptides, comprising at least a portion of a breast tumor protein, and to 
polynucleotides encoding such polypeptides. Such polypeptides and 
10 polynucleotides are useful in pharmaceutical compositions, e.g., vaccines, and 
other compositions for the diagnosis and treatment of breast cancer. 

Description of the Related Art 

Cancer is a significant health problem throughout the world. 
Although advances have been made in detection and therapy of cancer, no 

1 5 vaccine or other universally successful method for prevention and/or treatment 
is currently available. Current therapies, which are generally based on a 
combination of chemotherapy or surgery and radiation, continue to prove 
inadequate in many patients. 

Breast cancer is a significant health problem for women in the 

20 United States and throughout the world. Although advances have been made 
in detection and treatment of the disease, breast cancer remains the second 
leading cause of cancer-related deaths in women, affecting more than 180,000 
women in the United States each year. For women in North America, the life- 
time odds of getting breast cancer are now one in eight. 

25 No vaccine or other universally successful method for the 

prevention or treatment of breast cancer is currently available. Management of 
the disease currently relies on a combination of early diagnosis (through routine 
breast screening procedures) and aggressive treatment, which may include one 
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or more of a variety of treatments such as surgery, radiotherapy, chemotherapy 
and hormone therapy. The course of treatment for a particular breast cancer is 
often selected based on a variety of prognostic parameters, including an 
analysis of specific tumor markers. See, e.g., Porter-Jordan and Lippman, 
5 Breast Cancer 8:73-1 00 (1 994). However, the use of established markers 
often leads to a result that is difficult to interpret, and the high mortality 
observed in breast cancer patients indicates that improvements are needed in 
the treatment, diagnosis and prevention of the disease. 

In spite of considerable research into therapies for these and 
1 o other cancers, breast cancer remains difficult to diagnose and treat effectively. 
Accordingly, there is a need in the art for improved methods for detecting and 
treating such cancers. The present invention fulfills these needs and further 
provides other related advantages. 
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SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NO:1, 3-86, 142-298, 301- 
303, 307, 313, 314, 316, 317, 323, 325, 327-330, 335, 339, and 341-344; 

(b) complements of the sequences provided in SEQ ID NO:1 , 
20 3-86, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327^30, 335, 339, 

and 341 -344; 

(c) sequences consisting of at least 20, 25, 30, 35, 40, 45, 50, 
75 and 1 00 contiguous residues of a sequence provided in SEQ ID NO:1 , 3-86, 
142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 335, 339, and 

25 341-344; 

(d) sequences that hybridize to a sequence provided in SEQ 
ID NO:1 , 3*6, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 
335, 339, and 341 -344, under moderate or highly stringent conditions; 
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(e) sequences having at least 75%, 80%, 85%, 90%, 95%, 
96%, 97%, 98% or 99% identity to a sequence of SEQ ID NO:1 , 3-86, 142-298, 
301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 335, 339, and 341-344; 

(f) degenerate variants of a sequence provided in SEQ ID 
5 NO:1 , 3-86, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 

335, 339, and 341-344. 

In one preferred embodiment, the polynucleotide compositions of 
the invention are expressed in at least about 20%, more preferably in at least 
about 30%, and most preferably in at least about 50% of breast tumors 
1 0 samples tested, at a level that is at least about 2-fold, preferably at least about 
5-fold, and most preferably at least about 1 0-fold higher than that for normal 
tissues. 

The present invention, in another aspect, provides polypeptide 
compositions comprising an amino acid sequence that is encoded by a 

1 5 polynucleotide sequence described above. 

The present invention further provides polypeptide compositions 
comprising an amino acid sequence selected from the group consisting of 
sequences recited in SEQ ID NO:131-140, 299, 300, 304-306, 308-312, 315, 
318, 324, 326, 331 -334, 336, 340, and 345-428. 

20 In certain preferred embodiments, the polypeptides and/or 

polynucleotides of the present invention are immunogenic, /.e., they are 
capable of eliciting an immune response, particularly a humoral and/or cellular 
immune response, as further described herein. 

The present invention further provides fragments, variants and/or 

25 derivatives of the disclosed polypeptide and/or polynucleotide sequences, 
wherein the fragments, variants and/or derivatives preferably have a level of 
immunogenic activity of at least about 50%, preferably at least about 70% and 
more preferably at least about 90% of the level of immunogenic activity of a 
polypeptide sequence set forth in SEQ ID NO:131-140, 299, 300, 304-306, 

30 308-31 2,315, 31 8, 324, 326, 331 -334, 336, 340, and 345-428 or a polypeptide 
sequence encoded by a polynucleotide sequence set forth in SEQ ID NO:1 , 3- 
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86, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 335, 339, 
and 341-344. 

The present invention further provides polynucleotides that 
encode a polypeptide described above, expression vectors comprising such 
5 polynucleotides and host cells transformed or transfected with such expression 
vectors. 

Within other aspects, the present invention provides 
pharmaceutical compositions comprising a polypeptide or polynucleotide as 
described above and a physiologically acceptable carrier. 
1 o Within a related aspect of the present invention, the 

pharmaceutical compositions, e.g., vaccine compositions, are provided for 
prophylactic or therapeutic applications. Such compositions generally comprise 
an immunogenic polypeptide or polynucleotide of the invention and an 
immunostimulant, such as an adjuvant. 
! 5 The present invention further provides pharmaceutical 

compositions that comprise: (a) an antibody or antigen-binding fragment 
thereof that specifically binds to a polypeptide of the present invention, or a 
fragment thereof; and (b) a physiologically acceptable carrier. 

Within further aspects, the present invention provides 
20 pharmaceutical compositions comprising: (a) an antigen presenting cell that 
expresses a polypeptide as described above and (b) a pharmaceutically 
acceptable carrier or excipient. Illustrative antigen presenting cells include 
dendritic cells, macrophages, monocytes, fibroblasts and B cells. 

Within related aspects, pharmaceutical compositions are provided 
25 that comprise: (a) an antigen presenting cell that expresses a polypeptide as 
described above and (b) an immunostimulant. 

The present invention further provides, in other aspects, fusion 
proteins that comprise at least one polypeptide as described above, as well as 
polynucleotides encoding such fusion proteins, typically in the form of 
30 pharmaceutical compositions, e.g., vaccine compositions, comprising a 
physiologically acceptable carrier and/or an immunostimulant. The fusions 
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proteins may comprise multiple immunogenic polypeptides or portions/variants 
thereof, as described herein, and may further comprise one or more 
polypeptide segments for facilitating the expression, purification and/or 
immunogenic'rty of the polypeptide®. 
5 Within further aspects, the present invention provides methods for 

stimulating an immune response in a patient, preferably a T cell response in a 
human patient, comprising administering a pharmaceutical composition 
described herein. The patient may be afflicted with breast cancer, in which 
case the methods provide treatment for the disease, or patient considered at 

1 0 risk for such a disease may be treated prophylactically. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to 
a patient a pharmaceutical composition as recited above. The patient may be 
afflicted with breast cancer, in which case the methods provide treatment for 

1 5 the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

The present invention further provides, within other aspects, 
methods for removing tumor cells from a biological sample, comprising 
contacting a biological sample with T cells that specifically react with a 

20 polypeptide of the present invention, wherein the step of contacting is 

performed under conditions and for a time sufficient to permit the removal of 
cells expressing the protein from the sample. 

Within related aspects, methods are provided for inhibiting the 
development of a cancer in a patient, comprising administering to a patient a 

25 biological sample treated as described above. 

Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide of the present invention, 
comprising contacting T cells with one or more of: (i) a polypeptide as 
described above; (ii) a polynucleotide encoding such a polypeptide; and/or (iii) 

30 an antigen presenting cell that expresses such a polypeptide; under conditions 
and for a time sufficient to permit the stimulation and/or expansion of T cells. 



5 



- • a nt . t c^,ini21%2E419C12/Fet^^ 
WpM013431M^a!W^fi^^ 

PCT/US02/24917 

WO 03/013431 

Isolated T cell populations comprising T cells prepared as described above are 
also provided. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to 
5 a patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
development of a cancer in a patient, comprising the steps of: (a) incubating 
CD4+ and/or CD8+ T cells isolated from a patient with one or more of : 0) a 
polypeptide comprising at least an immunogenic portion of polypeptide 
1 o disclosed herein; (ii) a polynucleotide encoding such a polypeptide; and (Hi) an 
antigen-presenting cell that expressed such a polypeptide; and (b) 
administering to the patient an effective amount of the proliferated T cells, and 
thereby inhibiting the development of a cancer in the patient. Proliferated cells 
may, but need not, be cloned prior to administration to the patient. 
! 5 Within further aspects, the present invention provides methods for 

determining the presence or absence of a cancer, preferably a breast cancer, 
in a patient comprising: (a) contacting a biological sample obtained from a 
patient with a binding agent that binds to a polypeptide as recited above; (b) 
detecting in the sample an amount of polypeptide that binds to the binding 
20 agent; and (c) comparing the amount of polypeptide with a predetermined cut- 
off value, and therefrom determining the presence or absence of a cancer in 
the patient. Within preferred embodiments, the binding agent is an antibody, 
more preferably a monoclonal antibody. 

The present invention also provides, within other aspects, 
25 methods for monitoring the progression of a cancer in a patient. Such methods 
comprise the steps of: (a) contacting a biological sample obtained from a 
patient at a first point in time with a binding agent that binds to a polypeptide as 
recited above; (b) detecting in the sample an amount of polypeptide that binds 
to the binding agent; (c) repeating steps (a) and (b) using a biological sample 
30 obtained from the patient at a subsequent point in time; and (d) comparing the 
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amount of polypeptide detected in step (c) with the amount detected in step (b) 
and therefrom monitoring the progression of the cancer in the patient. 

The present invention further provides, within other aspects, 
methods for determining the presence or absence of a cancer in a patient, 
5 comprising the steps of: (a) contacting a biological sample, e.g., tumor sample, 
serum sample, etc., obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present 
invention; (b) detecting in the sample a level of a polynucleotide, preferably 
mRNA, that hybridizes to the oligonucleotide; and (c) comparing the level of 

1 0 polynucleotide that hybridizes to the oligonucleotide with a predetermined cut- 
off value, and therefrom determining the presence or absence of a cancer in 
the patient. Within certain embodiments, the amount of mRNA is detected via 
polymerase chain reaction using, for example, at least one oligonucleotide 
primer that hybridizes to a polynucleotide encoding a polypeptide as recited 

15 above, or a complement of such a polynucleotide. Within other embodiments, „ 
the amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a 
polypeptide as recited above, or a complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the 

20 progression of a cancer in a patient, comprising the steps of: (a) contacting a 
biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present 
invention; (b) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) using a 

25 biological sample obtained from the patient at a subsequent point in time; and 
(d) comparing the amount of polynucleotide detected in step (c) with the 
amount detected in step (b) and therefrom monitoring the progression of the 
cancer in the patient. 

Within further aspects, the present invention provides antibodies, 

30 such as monoclonal antibodies, that bind to a polypeptide as described above, 
as well as diagnostic kits comprising such antibodies. Diagnostic kits 
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comprising one or more oligonucleotide probes or primers as described above 

are also provided. 

These and other aspects of the present invention will become 
apparent upon reference to the following detailed description and attached 
5 drawings. All references disclosed herein are hereby incorporated by reference 
in their entirety as if each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the differential display PCR products, separated 
by gel electrophoresis, obtained from cDNA prepared from normal breast tissue 
10 (lanes 1 and 2) and from cDNA prepared from breast tumor tissue from the 

same patient (lanes 3 and 4). The arrow indicates the band corresponding to 

B18Ag1. 

Figure 2 is a northern blot comparing the level of B18Ag1 mRNA 
in breast tumor tissue (lane 1) with the level in normal breast tissue. 
15 Figure 3 shows the level of B18Ag1 mRNA in breast tumor tissue 

compared to that in various normal and non-breast tumor tissues as 
determined by RNase protection assays. 

Figure 4 is a genomic clone map showing the location of 
additional retroviral sequences obtained from ends of Xbal restriction digests 
20 (provided in SEQ ID NO:3 - SEQ ID NO:1 0) relative to B1 8Ag1 . 

Figures 5A and 5B show the sequencing strategy, genomic 
organization and predicted open reading frame for the retroviral element 

containing B18Ag1. 

Figure 6 shows the nucleotide sequence of the representative 

25 breast tumor-specific cDNA B1 8Ag1 . 

Figure 7 shows the nucleotide sequence of the representative 

breast tumor-specif ic cDNA B17Ag1 . 

Figure 8 shows the nucleotide sequence of the representative 

breast tumor-specific cDNA B17Ag2. 
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Figure 9 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B1 3Ag2a. 

Figure 1 0 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B13Ag1b. 
5 Figure 1 1 shows the nucleotide sequence of the representative 

breast tumor-specific cDNA B13Ag1a. 

Figure 1 2 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B1 1 Ag1 . 

Figure 13 shows the nucleotide sequence of the representative 
1 0 breast tumor-specific cDNA B3CA3c. 

Figure 14 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B9CG1 . 

Figure 1 5 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B9CG3. 
1 5 Figure 1 6 shows the nucleotide sequence of the representative 

breast tumor-specific cDNA B2CA2. 

Figure 17 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B3CA1 . 

Figure 1 8 shows the nucleotide sequence of the representative 
20 breast tumor-specific cDNA B3CA2. 

Figure 19 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B3CA3. 

Figure 20 shows the nucleotide sequence of the representative 
breast tumor-specific cDNA B4CA1 . 
25 Figure 21 A depicts RT-PCR analysis of breast tumor genes in 

breast tumor tissues (lanes 1-8) and normal breast tissues (lanes 9-13) and 
HaO (lane 14). 

Figure 21 B depicts RT-PCR analysis of breast tumor genes in 
prostate tumors (lane 1 , 2), colon tumors (lane 3), lung tumor (lane 4), normal 
30 prostate (lane 5), normal colon (lane 6), normal kidney (lane 7), normal liver 
(lane 8), normal lung (lane 9), normal ovary (lanes 10, 18), normal pancreases 



9 



, ,« , ♦ ^,im21%2E419C1^etch/WO0301^ 
WO0301343JJM£^^ 

PCTAJS02/24917 

WO 03/013431 

(lanes 11, 12), normal skeletal muscle (lane 13), normal skin (lane 14), normal 
stomach (lane 15), normal testes (lane 16), normal small intestine (lane 17), 
HBL-100 (lane 19), MCF-12A (lane 20), breast tumors (lanes 21-23), H 2 0 (lane 

24), and colon tumor (lane 25). 
5 Figure 22 shows the recognition of a B1 1 Ag1 peptide (referred to 

as B1 1-8) by an anti-B1 1-8 CTL line. 

Rgure 23 shows the recognition of a cell line transduced with the 

antigen B1 1 Ag1 by the B1 1 -8 specific clone A1 . 

Figure 24 shows recognition of a lung adenocarcinoma line (LT- 
10 140-22) and abreast adenocarcinoma line (CAMA-1) bythe B11-8specific 
clone A1 . 

DETAILED DESCRIPTION OF THE INVENTION 

U.S. patents, U.S. patent application publications, U.S. patent 
applications, foreign patents, foreign patent applications and non-patent 
15 publications referred to in this specification and/or listed in the Applteation Data 
Sheet, are incorporated herein by reference, in their entirety. 

The present invention is directed generally to compositions and 
their use in the therapy and diagnosis of cancer, particularly breast cancer. As 
described further below, illustrative compositions of the present invention 
20 include, but are not restricted to, polypeptides, particularly immunogenic 
polypeptides, polynucleotides encoding such polypeptides, antibodies and 
other binding agents, antigen presenting cells (APCs) and immune system cells 
(e.g., T cells). 

The practice of the present invention will employ, unless indicated 
25 specifically to the contrary, conventional methods of virology, immunology, 
microbiology, molecular biology and recombinant DNA techniques within the 
skill of the art, many of which are described below for the purpose of 
illustration. Such techniques are explained fully in the literature. See, e.g., 
Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); 
30 Maniatis et al., Molecular Cloning: A Laboratory Manual (1982); DNA Cloning: 
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A Practical Approach, vol. I & II (D. Glover, ed.); Oligonucleotide Synthesis (N. 
Gait, ed., 1984); Nucleic Acid Hybridization (B. Hames&S. Higgins, eds., 
1985); Transcription and Translation (B. Hames & S. Higgins, eds., 1984); 
Animal Cell Culture (R. Freshney, ed., 1986); Perbal, A Practical Guide to 
5 Molecular Cloning (1 984). 

All publications, patents and patent applications cited herein, 
whether supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the 
singular forms "a," "an" and "the" include plural references unless the content 
1 0 clearly dictates otherwise. 

Polypeptide Compositions 

As used herein, the term "polypeptide" is used in its conventional 
meaning, i.e., as a sequence of amino acids. The polypeptides are not limited 
to a specific length of the product; thus, peptides, oligopeptides, and proteins 
1 5 are included within the definition of polypeptide, and such terms may be used 
interchangeably herein unless specifically indicated otherwise. This term also 
does not refer to or exclude post-expression modifications of the polypeptide, 
for example, glycosylations, acetylations, phosphorylations and the like, as well 
as other modifications known in the art, both naturally occurring and non- 
20 naturally occurring. A polypeptide may be an entire protein, or a subsequence 
thereof. Particular polypeptides of interest in the context of this invention are 
amino acid subsequences comprising epitopes, i.e., antigenic determinants 
substantially responsible for the immunogenic properties of a polypeptide and 
being capable of evoking an immune response. 
25 Particularly illustrative polypeptides of the present invention 

comprise those encoded by a polynucleotide sequence set forth in any one of 
SEQ ID NO:1, 3-86, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327- 
330, 335, 339, and 341-344, or a sequence that hybridizes under moderately 
stringent conditions, or, alternatively, under highly stringent conditions, to a 
30 polynucleotide sequence set forth in any one of SEQ ID NO:1 , 3-86, 1 42-298, 
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301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 335, 339, and 341-344. 
Certain other illustrative polypeptides of the invention comprise amino acid 
sequences as set forth in any one of SEQ ID NO.131-140, 299, 300, 304-306, 
308-312, 315, 318, 324, 326, 331-334, 336, 340, and 345-428. 
5 The polypeptides of the present invention are sometimes herein 

referred to as breast tumor proteins or breast tumor polypeptides, as an 
indication that their identification has been based at least in part upon their 
increased levels of expression in breast tumor samples. Thus, a "breast tumor 
polypeptide" or "breast tumor protein," refers generally to a polypeptide 
10 sequence of the present invention, or a polynucleotide sequence encoding 
such a polypeptide, that is expressed in a substantial proportion of breast tumor 
samples, for example preferably greater than about 20%, more preferably 
greater than about 30%, and most preferably greater than about 50% or more 
of breast tumor samples tested, at a level that is at least two fold, and 
15 preferably at least five fold, greater than the level of expression in normal 
tissues, as determined using a representative assay provided herein. A breast 
tumor polypeptide sequence of the invention, based upon Its increased level of 
expression in tumor cells, has particular utility both as a diagnostic marker as 
well as a therapeutic target, as further described below. 
20 In certain preferred embodiments, the polypeptides of the 

invention are immunogenic, i.e., they react detectably within an immunoassay 
(such as an EUSA or T-cell stimulation assay) with antisera and/or T-cells from 
a patient with breast cancer. Screening for immunogenic activity can be 
performed using techniques well known to the skilled artisan. For example, 
25 such screens can be performed using methods such as those described in 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor 
Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and contacted with patient sera to allow binding 
of antibodies within the sera to the immobilized polypeptide. Unbound sera_ 
30 may then be removed and bound antibodies detected using, for example, 125 l- 
labeled Protein A. 



12 



WO0 301343 1 f h tt p:/Aiww.getthepatent.co^^ 



WO 03/013431 PCT/US02/24917 

As would be recognized by the skilled artisan, immunogenic 
portions of the polypeptides disclosed herein are also encompassed by the 
present invention. An "immunogenic portion," as used herein, is a fragment of 
an immunogenic polypeptide of the invention that itself is immunologically 

5 reactive (i.e., specifically binds) with the B-cells and/or T-cell surface antigen 
receptors that recognize the polypeptide. Immunogenic portions may generally 
be identified using well known techniques, such as those summarized in Paul, 
Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 1993) and 
references cited therein. Such techniques include screening polypeptides for 

1 0 the ability to react with antigen-specific antibodies, antisera and/or T-cell lines 
or clones. As used herein, antisera and antibodies are "antigen-specific" if they 
specifically bind to an antigen (i.e., they react with the protein in an ELISA or 
other immunoassay, and do not react detectably with unrelated proteins). Such 
antisera and antibodies may be prepared as described herein, and using well- 

15 known techniques. 

In one preferred embodiment, an immunogenic portion of a 
polypeptide of the present invention is a portion that reacts with antisera and/or 
T-cells at a level that is not substantially less than the reactivity of the full-length 
polypeptide (e.g., in an ELISA and/or T-cell reactivity assay). Preferably, the 

20 level of immunogenic activity of the immunogenic portion is at least about 50%, 
preferably at least about 70% and most preferably greater than about 90% of 
the immunogenicity for the full-length polypeptide. In some instances, 
preferred immunogenic portions will be identified that have a level of 
immunogenic activity greater than that of the corresponding full-length 

25 polypeptide, e.g., having greater than about 1 00% or 1 50% or more 
immunogenic activity. 

In certain other embodiments, illustrative immunogenic portions 
may include peptides in which an N-terminal leader sequence and/or 
transmembrane domain have been deleted. Other illustrative immunogenic 

30 portions will contain a small N- and/or C-terminal deletion (e.g., 1 -30 amino 
acids, preferably 5-15 amino acids), relative to the mature protein. 
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In another embodiment, a polypeptide composition of the 
invention may also comprise one or more polypeptides that are 
immunologically reactive with T cells and/or antibodies generated against a 
polypeptide of the invention, particularly a polypeptide having an amino acid 
5 sequence disclosed herein, or to an immunogenic fragment or variant thereof. 
In another embodiment of the invention, polypeptides are 
provided that comprise one or more polypeptides that are capable of eliciting T 
cells and/or antibodies that are immunologically reactive with one or more 
polypeptides described herein, or one or more polypeptides encoded by 
1 o contiguous nucleic acid sequences contained in the polynucleotide sequences 
disclosed herein, or immunogenic fragments or variants thereof, or to one or 
more nucleic acid sequences which hybridize to one or more of these 
sequences under conditions of moderate to high stringency. 

The present invention, in another aspect, provides polypeptide 
15 fragments comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguous 
amino acids, or more, including all intermediate lengths, of a polypeptide 
compositions set forth herein, such as those set forth in SEQ ID NO:131-140, 
299, 300, 304-306, 308-312, 315, 318, 324, 326, 331-334, 336, 340, and 345- 
428 or those encoded by a polynucleotide sequence set forth in a sequence of 
20 SEQ ID NO:1 , 3-86, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327- 

330, 335, 339, and 341-344. 

In another aspect, the present invention provides variants of the 
P 6lypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 
25 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or 
more identity (determined as described below), along its length, to a 
polypeptide sequences set forth herein. 

In one preferred embodiment, the polypeptide fragments and 
variants provided by the present invention are immunologically reactive with an 
30 antibody and/or T-cell that reacts with a full-length polypeptide specifically set 
forth herein. 
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In another preferred embodiment, the polypeptide fragments and 
variants provided by the present invention exhibit a level of immunogenic 
activity of at least about 50%, preferably at least about 70%, and most 
preferably at least about 90% or more of that exhibited by a full-length 
5 polypeptide sequence specifically set forth herein. 

A polypeptide "variant," as the term is used herein, is a 
polypeptide that typically differs from a polypeptide specifically disclosed herein 
in one or more substitutions, deletions, additions and/or insertions. Such 
variants may be naturally occurring or may be synthetically generated, for 

10 example, by modifying one or more of the above polypeptide sequences of the 
invention and evaluating their immunogenic activity as described herein and/or 
using any of a number of techniques well known in the art. 

For example, certain illustrative variants of the polypeptides of the 
invention include those in which one or more portions, such as an N-terminal 

1 5 leader sequence or transmembrane domain, have been removed. Other 
illustrative variants include variants in which a small portion (e.g., 1-30 amino 
acids, preferably 5-15 amino acids) has been removed from the N- and/or C- 
terminal of the mature protein. 

In many instances, a variant will contain conservative 

20 substitutions. A "conservative substitution" is one in which an amino acid is 
substituted for another amino acid that has similar properties, such that one 
skilled in the art of peptide chemistry would expect the secondary structure and 
hydropathic nature of the polypeptide to be substantially unchanged. As 
described above, modifications may be made in the structure of the 

25 polynucleotides and polypeptides of the present invention and still obtain a 
functional molecule that encodes a variant or derivative polypeptide with 
desirable characteristics, e.g., with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an 
equivalent, or even an improved, immunogenic variant or portion of a 

30 polypeptide of the invention, one skilled in the art will typically change one or 
more of the codons of the encoding DNA sequence according to Table 1 . 

15 
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For example, certain amino acids may be substituted for other 
amino acids in a protein structure without appreciable loss of interactive binding 
capacity with structures such as, for example, antigen-binding regions of 
antibodies or binding sites on substrate molecules. Since it is the interactive 

5 capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein 
sequence, and, of course, its underlying DNA coding sequence, and 
nevertheless obtain a protein with like properties. It is thus contemplated that 
various changes may be made in the peptide sequences of the disclosed 

10 compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 

Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 


Cysteine 


Cys 


C 


UGC 


UGU 






Aspartic acid 


Asp 


D 


GAC 


GAU 






Glutamic acid 


Glu 


E 


GAA 


GAG 






Phenylalanine 


Phe 


F 


UUC 


UUU 






Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 


Histidine 


His 


H 


CAC 


CAU 






Isoleucine 


lie 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


Methionine 


Met 


M 


AUG 








Asparagine 


Asn 


N 


AAC 


AAU 






Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


CAG 






Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 



CUG CUU 



CGC CGG CGU 
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Amino Acids Codons 



Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Tip 


w 


UGG 








Tyrosine 


Tyr 


Y 


UAC 


UAU 







In making such changes, the hydropathic index of amino acids 
may be considered. The importance of the hydropathic amino acid index in 
conferring interactive biologic function on a protein is generally understood in 

5 the art (Kyte and Doolittle, 1982, incorporated herein by reference). It is 
accepted that the relative hydropathic character of the amino acid contributes 
to the secondary structure of the resultant protein, which in turn defines the 
interaction of the protein with other molecules, for example, enzymes, 
substrates, receptors, DNA, antibodies, antigens, and the like. Each amino 

1 0 acid has been assigned a hydropathic index on the basis of its hydrophobicity 
and charge characteristics (Kyte and Doolittle, 1982). These values are: 
isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); 
cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); 
threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1 .3); proline (-1 .6); 

1 5 histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); 
asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted 
by other amino acids having a similar hydropathic index or score and still result 
in a protein with similar biological activity, i.e., still obtain a biological 

20 functionally equivalent protein. In making such changes, the substitution of 
amino acids whose hydropathic indices are within ±2 is preferred, those within 
±1 are particularly preferred, and those within ±0.5 are even more particularly 
preferred. It is also understood in the art that the substitution of like amino 
acids can be made effectively on the basis of hydrophilicity. U.S. Patent 

25 4,554,1 01 (specifically incorporated herein by reference in its entirety), states 
that the greatest local average hydrophilicity of a protein, as governed by the 

17 
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hydrophilicity of its adjacent amino acids, correlates with a biological property of 
the protein. 

As detailed in U.S. Patent 4,554,1 01 , the following hydrophilicity 
values have been assigned to amino acid residues: arginine (+3.0); lysine 
5 (+3.0); aspartate (+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine 
(+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ± 1); alanine 
H.5); histidine (-0.5); cysteine (-1 0); methionine (-1 .3); valine (-1 .5); leucine 
(-1 .8); isoleucine (-1 .8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (- 
3.4). It is understood that an amino acid can be substituted for another having 
10 a similar hydrophilicity value and still obtain a biologically equivalent, and in 
particular, an immunologically equivalent protein. In such changes, the 
substitution of amino acids whose hydrophilicity values are within ±2 is 
preferred, those within ±1 are particularly preferred, and those within ±0.5 are 
even more particularly preferred. 
1 5 As outlined above, amino acid substitutions are generally 

therefore based on the relative similarity of the amino acid side-chain 
substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and 
the like. Exemplary substitutions that take various of the foregoing 
characteristics into consideration are well known to those of skill in the art and 
20 include: arginine and lysine; glutamate and aspartate; serine and threonine; 
glutamine and asparagine; and valine, leucine and isoleucine. 

In addition, any polynucleotide may be further modified to 
increase stability in vivo. Possible modifications include, but are not limited to, 
the addition of flanking sequences at the 5' and/or 3' ends; the use of 
25 phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages in the 
backbone; and/or the inclusion of nontraditional bases such as inosine, 
queosine and wybutosine, as well as acetyl- methyl-, thio- and other modified 
forms of adenine, cytidine, guanine, thymine and uridine. 

Amino acid substitutions may further be made on the basis of 
30 similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the 
amphipathic nature of the residues. For example, negatively charged amino 
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acids include aspartic acid and glutamic acid; positively charged amino acids 
include lysine and arginine; and amino acids with uncharged polar head groups 
having similar hydrophilicity values include leucine, isoleucine and valine; 
glycine and alanine; asparagine and glutamine; and serine, threonine, 
5 phenylalanine and tyrosine. Other groups of amino acids that may represent 
conservative changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, 
tyr, trp, his. A variant may also, or alternatively, contain nonconservative 
changes. In a preferred embodiment, variant polypeptides differ from a native 

10 sequence by substitution, deletion or addition of five amino acids or fewer. 
Variants may also (or alternatively) be modified by, for example, the deletion or 
addition of amino acids that have minimal influence on the immunogenicity, 
secondary structure and hydropathic nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) 

15 sequence at the N-terminal end of the protein, which co-translationally or post- 
translationally directs transfer of the protein. The polypeptide may also be 
conjugated to a linker or other sequence for ease of synthesis, purification or 
identification of the polypeptide (e.g., poly-His), or to enhance binding of the 
polypeptide to a solid support. For example, a polypeptide may be conjugated 

20 to an immunoglobulin Fc region. 

When comparing polypeptide sequences, two sequences are said 
to be "identical" if the sequence of amino acids in the two sequences is the 
same when aligned for maximum correspondence, as described below. 
Comparisons between two sequences are typically performed by comparing the 

25 sequences over a comparison window to identify and compare local regions of 
sequence similarity. A "comparison window" as used herein, refers to a 
segment of at least about 20 contiguous positions, usually 30 to about 75, 40 to 
about 50, in which a sequence may be compared to a reference sequence of 
the same number of contiguous positions after the two sequences are optimally 

30 aligned. 



19 



, e , , .-, 1 n 1 ~ 1 ■ / "r it 1 QC 1 iTf-HV""™"™ ^ r.p C ?fromCache=1 p ar^mmmM=!^^^^ 
WOW013mMEtf!^ 

PCT/US02/24917 

WO 03/013431 



Optimal alignment of sequences for comparison may be 
conducted using the Megalign program in the Lasergene suite of bioinformatics 
software (DNASTAR, Inc., Madison, Wl), using default parameters.. This 
program embodies several alignment schemes described in the following 

5 references: Dayhoff, M.O. (1978) A model of evolutionary change in proteins - 
Matrices for detecting distant relationships. In Dayhoff, M.O. (ed.) Atlas of 
Protein Sequence and Structure, National Biomedical Research Foundation, 
Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified Approach 
to Alignment and Phylogenes pp. 626-645 Methods in Enzymologyvol 183, 

10 Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:1 1-17; 
Robinson, ED. (1971) Comb. Theor 11105; Saitou, N. Nei, M. (1987) Mol. 
Biol. Evol. 4:406-425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical 
Taxonomy -the Principles and Practice of Numerical Taxonomy, Freeman 

15 Press, San Francisco, CA; Wilbur, W.J. and Lipman, D.J. (1983) Pmc. Natl. 

Acad., Sci. USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may 
be conducted by the local identity algorithm of Smith and Waterman (1981) 
Add. APL. Math 2:482, by the identity alignment algorithm of Needleman and 
20 Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity methods of 
Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 85: 2444, by 
computerized implementations of these algorithms (GAP, BESTFIT, BLAST, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group (GCG), 575 Science Dr., Madison, Wl), or by inspection. 
25 One preferred example of algorithms that are suitable for 

determining percent sequence identity and sequence similarity are the BLAST 
and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nucl. 
Acids Res. 25:3389-3402 and Altschul et al. (1990) J. Mol. Biol. 215:403-41 0, 
respectively. BLAST and BLAST 2.0 can be used, for example with the 
30 parameters described herein, to determine percent sequence identity for the 
polynucleotides and polypeptides of the invention. Software for performing 
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BLAST analyses is publicly available through the National Center for 
Biotechnology Information. For amino acid sequences, a scoring matrix can be 
used to calculate the cumulative score. Extension of the word hits in each 
direction are halted when: the cumulative alignment score falls off by the 
5 quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue 
alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T and X determine the sensitivity and speed of the alignment. 

In one preferred approach, the "percentage of sequence identity" 

10 is determined by comparing two optimally aligned sequences over a window of 
comparison of at least 20 positions, wherein the portion of the polypeptide 
sequence in the comparison window may comprise additions or deletions (/.e., 
gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 percent, as 
compared to the reference sequences (which does not comprise additions or 

1 5 deletions) for optimal alignment of the two sequences. The percentage is 
calculated by determining the number of positions at which the identical amino 
acid residue occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of 
positions in the reference sequence {i.e., the window size) and multiplying the 

20 results by 1 00 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a 
xenogeneic polypeptide that comprises an polypeptide having substantial 
sequence identity, as described above, to the human polypeptide (also termed 
autologous antigen) which served as a reference polypeptide, but which 

25 xenogeneic polypeptide is derived from a different, non-human species. One 
skilled in the art will recognize that "self" antigens are often poor stimulators of 
CD8+ and CD4+ T-lymphocyte responses, and therefore efficient 
immunotherapeutic strategies directed against tumor polypeptides require the 
development of methods to overcome immune tolerance to particular self tumor 

30 polypeptides. For example, humans immunized with prostase protein from a 
xenogeneic (non human) origin are capable of mounting an immune response 
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against the counterpart human protein, e.g., the human prostase lumor protein 
present on human tumor cells. Accordingly, the present invention provides 
methods for purifying the xenogeneic form of the tumor proteins set forth 
herein, such as the polypeptides set forth in SEQ ID NO:131-140, 299, 300, 
5 304-306, 308-312, 315, 318, 324, 326, 331-334, 336, 340, and 345-428, or 
those encoded by polynucleotide sequences set forth in SEQ ID NO:1 , 3-86, 
142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 335, 339, and 
341-344. 

Therefore, one aspect of the present invention provides 
10 xenogeneic variants of the polypeptide compositions described herein. Such 
xenogeneic variants generally encompassed by the present invention will 
typically exhibit at least about 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 
94%, 95%, 96%, 97%, 98%, or 99% or more identity along their lengths, to a 
polypeptide sequences set forth herein. 
1 5 More particularly, the invention is directed to mouse, rat, monkey, 

porcine and other non-human polypeptides which can be used as xenogeneic 
forms of human polypeptides set forth herein, to induce immune responses 
directed against tumor polypeptides of the invention. 

Within other illustrative embodiments, a polypeptide may be a 
20 fusion polypeptide that comprises multiple polypeptides as described herein, or 
that comprises at least one polypeptide as described herein and an unrelated 
sequence, such as a known tumor protein. A fusion partner may, for example, 
assist in providing T helper epitopes (an immunological fusion partner), 
preferably T helper epitopes recognized by humans, or may assist in 
25 expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both 
immunological and expression enhancing fusion partners. Other fusion 
partners may be selected so as to increase the solubility of the polypeptide or 
to enable the polypeptide to be targeted to desired intracellular compartments. 
30 Still further fusion partners include affinity tags, which facilitate purification of 
the polypeptide. 
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Fusion polypeptides may generally be prepared using standard 
techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
expressed as a recombinant polypeptide, allowing the production of increased 
levels, relative to a non-fused polypeptide, in an expression system. Briefly, 
5 DNA sequences encoding the polypeptide components may be assembled 
separately, and ligated into an appropriate expression vector. The 3' end of the 
DNA sequence encoding one polypeptide component is ligated, with or without 
a peptide linker, to the 5' end of a DNA sequence encoding the second 
polypeptide component so that the reading frames of the sequences are in 

1 0 phase. This permits translation into a single fusion polypeptide that retains the 
biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first 
and second polypeptide components by a distance sufficient to ensure that 
each polypeptide folds into its secondary and tertiary structures. Such a 

15 peptide linker sequence is incorporated into the fusion polypeptide using 
standard techniques well known in the art Suitable peptide linker sequences 
may be chosen based on the following factors: (1 ) their ability to adopt a 
flexible extended conformation; (2) their inability to adopt a secondary structure 
that could interact with functional epitopes on the first and second polypeptides; 

20 and (3) the lack of hydrophobic or charged residues that might react with the 
polypeptide functional epitopes. Preferred peptide linker sequences contain 
Gly, Asn and Ser residues. Other near neutral amino acids, such as Thr and 
Ala may also be used in the linker sequence. Amino acid sequences which 
may be usefully employed as linkers include those disclosed in Maratea et al., 

25 Gene 40:39-46, 1985; Murphy et al., Proa Natl Acad Sci. USA 83:8258-8262, 
1986; U.S. Patent No. 4,935,233 and U.S. Patent No. 4,751 ,180. The linker 
sequence may generally be from 1 to about 50 amino acids in length. Linker 
sequences are not required when the first and second polypeptides have non- 
essential N-terminal amino acid regions that can be used to separate the 

30 functional domains and prevent steric interference. 
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The ligated DNA sequences are operably linked to suitable 
transcriptional or translation^ regulatory elements. The regulatory elements 
responsiblefor expression of DNA are located only 5' to the DNA sequence 
encoding the first polypeptides. Similarly, stop codons required to end 
5 translation and transcription termination signals are only present 3' to the DNA 
sequence encoding the second polypeptide. 

The fusion polypeptide can comprise a polypeptide as described 
herein together with an unrelated immunogenic protein, such as an 
immunogenic protein capable of eliciting a recall response. Examples of such 
1 o proteins include tetanus, tuberculosis and hepatitis proteins (see, for example, 
Stoute et al. New Engl. J. Med., 336:86-91 , 1997). 

In one preferred embodiment, the immunological fusion partner is 
derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis- 
derived Ra12fragment. Ra12 compositions and methods for their use in 
15 enhancingthe expression and/or immunogenicity of heterologous 

polynuoleotide/pofypeptide sequences is described in U.S. Patent Application 
60/158,585, the disclosure of which is incorporated herein by reference in rts 
entirety. Briefly, Ra12 refers to a polynucleotide region that is a subsequence 
of a Mycobacterium tuberculosis MTB32A nucleic acid. MTB32A is a serine 
20 protease of 32 KD molecular weight encoded by a gene in virulent and avimlent 
strains of M. tuberculosis. The nucleotide sequence and amino acid sequence 
of MTB32A have been described (for example, U.S. Patent Application 
60/158,585; see also, Skeiky et al., Infection andlmmun. (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
25 sequence express at high levels and remain as a soluble polypeptides 
throughout the purification process. Moreover, Ra12 may enhance the 
immunogenicity of heterologous immunogenic polypeptides with which it is 
fused. One preferred Ra12fusion polypeptide comprises a 14 KD C-terminal 
fragment corresponding to amino acid residues 192 to 323 of MTB32A. Other 
30 preferred Ra12 polynucleotides generally comprise at least about 15 
consecutive nucleotides, at least about 30 nucleotides, at least about 60 
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nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, or 
at least about 300 nucleotides that encode a portion of a Ra12 polypeptide. 
Ra12 polynucleotides may comprise a native sequence (/.e., an endogenous 
sequence that encodes a Ra12 polypeptide or a portion thereof) or may 

5 comprise a variant of such a sequence. Ra1 2 polynucleotide variants may 
contain one or more substitutions, additions, deletions and/or insertions such 
that the biological activity of the encoded fusion polypeptide is not substantially 
diminished, relative to a fusion polypeptide comprising a native Ra12 
polypeptide. Variants preferably exhibit at least about 70% identity, more 

1 0 preferably at least about 80% identity and most preferably at least about 90% 
identity to a polynucleotide sequence that encodes a native Ra12 polypeptide 
or a portion thereof. 

Within other preferred embodiments, an immunological fusion 
partner is derived from protein D, a surface protein of the gram-negative 

15 bacterium Haemophilus influenza B (WO 91/18926). Preferably, a protein D 
derivative comprises approximately the first third of the protein (e.g., the first N- 
terminal 100-1 10 amino acids), and a protein D derivative may be lipidated. 
Within certain preferred embodiments, the first 109 residues of a Lipoprotein D 
fusion partner is included on the N-terminus to provide the polypeptide with 

20 additional exogenous T-cell epitopes and to increase the expression level in £ 
coli (thus functioning as an expression enhancer). The lipid tail ensures 
optimal presentation of the antigen to antigen presenting cells. Other fusion 
partners include the non-structural protein from influenzae virus, NS1 
(hemaglutinin). Typically, the N-terminal 81 amino acids are used, although 

25 different fragments that include T-helper epitopes may be used. 

In another embodiment, the immunological fusion partner is the 
protein known as LYTA, or a portion thereof (preferably a C-terminal portion). 
LYTA is derived from Streptococcus pneumoniae, which synthesizes an N- 
acetyl-L-alanine amidase known as amidase LYTA (encoded by the LytA gene; 

30 Gene 43:265-292, 1 986). LYTA is an autolysin that specifically degrades 
certain bonds in the peptidoglycan backbone. The C-terminal domain of the 
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LYTA protein is responsible for the affinity to the choline or to some choline 
analogues such as DEAE. This property has been exploited for the 
development of E oof C-LYTA expressing plasmids useful for expression of 
fusion proteins. Purification of hybrid proteins containing the C-LYTA fragment 
5 at the amino terminus has been described (see Biotechnology 10:795-798, 
• 1 992). Within a preferred embodiment, a repeat portion of LYTA may be 
incorporated into a fusion polypeptide. A repeat portion is found in the C- 
terminal region starting at residue 178. A particularly preferred repeat portion 

incorporates residues 188-305. 
10 Yetanother illustrative embodiment involves fusion polypeptides, 

and the polynucleotides encoding them, wherein the fusion partner comprises a 
targeting sfcnal capable of directing a polypeptide to the endosomatflysosomal 
compartment, as described in U.S. Patent No. 5,633,234. An immunogenic 
polypeptide of the invention, when fused with this targeting signal, will associate 
1 5 more efficiently with MHC class II molecules and thereby provide enhanced in 
vivo stimulation of CD4 + T-cells specific for the polypeptide. 

Polypeptides of the invention are prepared using any of a variety 
of well known synthetic and/or recombinant techniques, the latter of which are 
further described below. Polypeptides, portions and other variants generally 
20 less than about 1 50 amino acids can be generated by synthetic means, using 
techniques well known to those of ordinary skill in the art. In one illustrative 
example, such polypeptides are synthesized using any of the commercially 
available solid-phase techniques, such as the Merrifield solid-phase synthesis 
method, where amino acids are sequentially added to a growing amino acid 
25 chain. See Merrifield, J. Am. Chem. Soc. 85:2149-2146, 1963. Equipmentfor 
automated synthesis of polypeptides is commercially available from suppliers 
such as Perkin Elmer/Applied Biosystems Division (Foster City, CA), and may 
be operated according to the manufacturer's instructions. 

In general, polypeptide compositions (including fusion 
30 polypeptides) of the invention are isolated. An "isolated" polypeptide is one 
that is removed from its original environment. For example, a naturally- 

26 
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occurring protein or polypeptide is isolated if it is separated from some or all of 
the coexisting materials in the natural system. Preferably, such polypeptides 
are also purified, e.g., are at least about 90% pure, more preferably at least 
about 95% pure and most preferably at least about 99% pure. 

5 Polynucleotide Compositions 

The present invention, in other aspects, provides polynucleotide 
compositions. The terms "DNA" and "polynucleotide" are used essentially 
interchangeably herein to refer to a DNA molecule that has been isolated free 
of total genomic DNA of a particular species. "Isolated," as used herein, 

1 0 means that a polynucleotide is substantially away from other coding 

sequences, and that the DNA molecule does not contain large portions of 
unrelated coding DNA, such as large chromosomal fragments or other 
functional genes or polypeptide coding regions. Of course, this refers to the 
DNA molecule as originally isolated, and does not exclude genes or coding 

1 5 regions later added to the segment by the hand of man . 

As will be understood by those skilled in the art, the 
polynucleotide compositions of this invention can include genomic sequences, 
extra-genomic and plasmid-encoded sequences and smaller engineered gene 
segments that express, or may be adapted to express, proteins, polypeptides, 

20 peptides and the like. Such segments may be naturally isolated, or modified 
synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides 
of the invention may be single-stranded (coding or antisense) or double- 
stranded, and may be DNA (genomic, cDNA or synthetic) or RNA molecules. 

25 RNA molecules may include HnRNA molecules, which contain introns and 
correspond to a DNA molecule in a one-to-one manner, and mRNA molecules, 
which do not contain introns. Additional coding or non-coding sequences may, 
but need not, be present within a polynucleotide of the present invention, and a 
polynucleotide may, but need not, be linked to other molecules and/or support 

30 materials. 
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Polynucleotides may comprise a native sequence (i.e., an 
endogenous sequence that encodes a polypeptide/protein of the invention or a 
portion thereof) or may comprise a sequence that encodes a variant or 
derivative, preferably and immunogenic variant or derivative, of such a 
5 sequence. 

Therefore, according to another aspect of the present invention, 
polynucleotide compositions are provided that comprise some or all of a 
po^nucleotide sequence set forth in any one of SEQ ID NO:1 , 3-86, 142-298, 
301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 335, 339, and 341-344, 
10 complements of a polynucleotide sequence set forth in any one of SEQ ID 
NO-1 3-86, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 325, 327-330, 
335 339 and 341 -344, and degenerate variants of a polynucleotide sequence 

set forth in any one of SEQ ID NO:1 , 3-86, 142-298, 301-303, 307, 313, 314, 
316, 317, 323, 325, 327-330, 335, 339, and 341-344. In certain preferred 
15 embodiments, the polynucleotide sequences set forth herein encode 
immunogenic polypeptides, as described above. 

In other related embodiments, the present invention provides 
polynucleotide variants having substantial identity to the sequences disclosed 
herein in SEQ ID NO:1. 3-86, 142-298, 301-303, 307, 313, 314, 316, 317, 323, 
20 325 327-330, 335, 339, and 341 -344, for example those comprising at least 
70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 
97%, 98%, or 99% or higher, sequence identity compared to a polynucleotide 
sequence of this invention using the methods described herein, (e.g., BLAST 
anatysis using standard parameters, as described below). One skilled in this 
25 art will recognize that these values can be appropriately adjusted to determ.ne 
corresponding idenfrty of proteins encoded by two nucleotide sequences by 
taking into account codon degeneracy, amino acid similarity, reading frame 

positioning and the like. 

Typically, polynucleotide variants will contain one or more 
30 substitutions, additions, deletions and/or insertions, preferably such thatthe 
immunogenic^ of the polypeptide encoded by the variant polynucleotide ,s not 
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substantially diminished relative to a polypeptide encoded by a polynucleotide 
sequence specifically set forth herein). The term "variants'' should also be 
understood to encompasses homologous genes of xenogenic origin. 

In additional embodiments, the present invention provides 

5 polynucleotide fragments comprising or consisting of various lengths of 

contiguous stretches of sequence identical to or complementary to one or more 
of the sequences disclosed herein. For example, polynucleotides are provided 
by this invention that comprise or consist of at least about 10, 15, 20, 30, 40, 
50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more contiguous nucleotides of 

1 0 one or more of the sequences disclosed herein as well as all intermediate 
lengths there between. It will be readily understood that "intermediate lengths', 
in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, efc.; 30, 31, 32, efc.; 50, 51, 52, 53, etc.; 100, 101, 102, 
103, etc.; 150, 151 , 152, 153, ete.; including all integers through 200-500; 500- 

15 1 ,000, and the like. A polynucleotide sequence as described here may be 
extended at one or both ends by additional nucleotides not found in the native 
sequence. This additional sequence may consist of 1 , 2, 3, 4, 5, 6, 7, 8, 9, 1 0, 
11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides at either end of the 
disclosed sequence or at both ends of the disclosed sequence. 

20 In another embodiment of the invention, polynucleotide 

compositions are provided that are capable of hybridizing under moderate to 
high stringency conditions to a polynucleotide sequence provided herein, or a 
fragment thereof, or a complementary sequence thereof. Hybridization 
techniques are well known in the art of molecular biology. For purposes of 

25 illustration, suitable moderately stringent conditions for testing the hybridization 
of a polynucleotide of this invention with other polynucleotides include 
prewashing in a solution of 5 X SSC, 0.5% SDS, 1 .0 mM EDTA (pH 8.0); 
hybridizing at 50 Q C-60 Q C, 5 X SSC, overnight; followed by washing twice at 
65 9 C for 20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1 % SDS. 

30 One skilled in the art will understand that the stringency of hybridization can be 
readily manipulated, such as by altering the salt content of the hybridization 
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solution and/or the temperature at which the hybridization is performed. For 
example, in another embodiment, suitable highly stringent hybridization 
conditions include those described above, with the exception that the 
temperature of hybridization is increased, e.g., to 60-65°C or 65-70°C. 
5 In certain preferred embodiments, the polynucleotides described 

above, e.g., polynucleotide variants, fragments and hybridizing sequences, 
encode polypeptides that are immunologically cross-reactive with a polypeptide 
sequence specifically set forth herein. In other preferred embodiments, such 
por/nucleotides encode polypeptides that have a level of immunogenic activfty 
1 0 of at least about 50%, preferably at least about 70%, and more preferably at 
least about 90% of that for a polypeptide sequence specifically set forth here.n. 

The polynucleotides of the present invention, or fragments 
thereof, regardless of the length of the coding sequence itself, may be 
combined with other DNA sequences, such as promoters, polyadenylation 
15 signals, additional restriction enzyme sites, multiple cloning sites, other coding 
segments, and the like, such that their overall length may vary considerably. It 
is therefore contemplated that a nucleic acid fragment of almost any length 
may be employed, with the total length preferably being limited by the ease of 
preparation and use in the intended recombinant DNA protocol. For example, 
20 illustrative polynucleotide segments with total lengths of about 10,000, about 
5000, about 3000, about 2,000, about 1 ,000, about 500, about 200, about 100, 
about 50 base pairs in length, and the like, (including all intermediate lengths) 
are contemplated to be useful in many implementations of this invention. 

When comparing polynucleotide sequences, two sequences are 
25 said to be "identical" if the sequence of nucleotides in the two sequences is the 
same when aligned for maximum correspondence, as described below. 
Comparisons between two sequences are typically performed by comparing the 
sequences over a comparison window to identify and compare local regions of 
sequence similarity. A "comparison window" as used herein, refers to a 
30 segment of at least about 20 contiguous posrtions, usually 30 to about 75, 40 to 
about 50, in which a sequence may be compared to a reference sequence of 
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the same number of contiguous positions after the two sequences are optimally 
aligned. 

Optimal alignment of sequences for comparison may be 
conducted using the Megalign program in the Lasergene suite of bioinformatics 

5 software (DNASTAR, Inc., Madison, Wl), using default parameters. This 
program embodies several alignment schemes described in the following 
references: Dayhoff, M.O. (1978) A model of evolutionary change in proteins - 
Matrices for detecting distant relationships. In Dayhoff, M.O. (ed.) Atlas of 
Protein Sequence and Structure, National Biomedical Research Foundation, 

10 Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified Approach 
to Alignment and Phylogenes pp. 626-645 Methods in EnzymologyvoL 183, 
Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; 
Robinson, E.D. (1971) Comb. Theor11A05\ Santou, N. Nes, M. (1987) Mol. 

15 Biol. Evol. 4:406-425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical 
Taxonomy -the Principles and Practice of Numerical Taxonomy, Freeman 
Press, San Francisco, CA; Wilbur, W.J. and Lipman, D.J. (1983) Proc. Natl. 
Acad., Sci. USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may 

20 be conducted by the local identity algorithm of Smith and Waterman (1 981 ) 
Add. APL Math 2:482, by the identity alignment algorithm of Needleman and 
Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity methods of 
Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 85: 2444, by 
computerized implementations of these algorithms (GAP, BESTFIT, BLAST, 

25 FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group (GCG), 575 Science Dr., Madison, Wl), or by inspection. 

One preferred example of algorithms that are suitable for 
determining percent sequence identity and sequence similarity are the BLAST 
and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nucl. 

30 Acids Res. 25:3389-3402 and Altschul et al. (1 990) J. Mol. Biol. 21 5:403-41 0, 
respectively. BLAST and BLAST 2.0 can be used, for example with the 
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parameters described herein, to determine percent sequence identity for the 
polynucleotides of the invention. Software for performing BLAST analyses is 
publicly available through the National Center for Biotechnology Information. In 
one illustrative example, cumulative scores can be calculated using, for 
5 nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always >0) and N (penalty score for mismatching residues; always 
<0). Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the 
1 o accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The BLASTN program 
(for nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , and 
expectation (E) of 1 0, and the BLOSUM62 scoring matrix (see Henikoff and 
15 Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments, (B) of 50, 
expectation (E) of 10, M=5, N=-4 and a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined 
by comparing two optimally aligned sequences over a window of comparison of 
at least 20 positions, wherein the portion of the polynucleotide sequence in the 
20 comparison window may comprise additions or deletions (i.e., gaps) of 20 
percent or less, usually 5 to 1 5 percent, or 10 to 1 2 percent, as compared to 
the reference sequences (which does not comprise additions or deletions) for 
optimal alignment of the two sequences. The percentage is calculated by 
determining the number of positions at which the identical nucleic acid bases 
25 occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the reference 
sequence (i.e., the window size) and multiplying the results by 1 00 to yield the 
percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a 
30 result of the degeneracy of the genetic code, there are many nucleotide 
sequences that encode a polypeptide as described herein. Some of these 
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polynucleotides bear minimal homology to the nucleotide sequence of any 
native gene. Nonetheless, polynucleotides that vary due to differences in 
codon usage are specifically contemplated by the present invention. Further, 
alleles of the genes comprising the polynucleotide sequences provided herein 
5 are within the scope of the present invention. Alleles are endogenous genes 
that are altered as a result of one or more mutations, such as deletions, 
additions and/or substitutions of nucleotides. The resulting mRNAand protein 
may, but need not, have an altered structure or function. Alleles may be 
identified using standard techniques (such as hybridization, amplification and/or 

1 0 database sequence comparison). 

Therefore, in another embodiment of the invention, a 
mutagenesis approach, such as site-specific mutagenesis, is employed for the 
preparation of immunogenic variants and/or derivatives of the polypeptides 
described herein. By this approach, specific modifications in a polypeptide 

1 5 sequence can be made through mutagenesis of the underlying polynucleotides 
that encode them. These techniques provides a straightforward approach to 
prepare and test sequence variants, for example, incorporating one or more of 
the foregoing considerations, by introducing one or more nucleotide sequence 
changes into the polynucleotide. 

20 Site-specific mutagenesis allows the production of mutants 

through the use of specific oligonucleotide sequences which encode the DNA 
sequence of the desired mutation, as well as a sufficient number of adjacent 
nucleotides, to provide a primer sequence of sufficient size and sequence 
complexity to form a stable duplex on both sides of the deletion junction being 

25 traversed. Mutations may be employed in a selected polynucleotide sequence 
to improve, alter, decrease, modify, or otherwise change the properties of the 
polynucleotide itself, and/or alter the properties, activity, composition, stability, 
or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors 

30 contemplate the mutagenesis of the disclosed polynucleotide sequences to 
alter one or more properties of the encoded polypeptide, such as the 
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^munogenicttycapolypeptidev^olne. The techniques of MMP-M 
mU tegenesis are we»-knowr, in the art. and are wider, used to ore* vanante 
of both polypeptides and polynucleotides. Fo, example, site-spec* 
gnosis is often used to alter a spe* portion o. a DNA molecule. 
5 su*embcdiments,a P ri m ero OT prtsin fl typicallyabo U t14toab al .^ 

nucieoUdes or so in leng* to enjoyed, wHh about 5 to ab«« 10 Kerfues on 
both sides of the junction of the sequence being altered. 

As will be appreciated by those of skill m the art, slte-speaftc 
genesis techniques have often employed a phage ve*r that oca* in 

directed mutegeneas indude veCors such as the M13 phage. These phage 
are readily comrnercially-av^ and their use is genera,* well-known to 
W. sK.iL in the art. Double-sWnded plasm,* are *. rouhnet, 

15 Interest from a plasmid to a phage. k=mrfhte 
in general, site-directed mutagenesis in accordance herewrth e 
per^edbyflrstob^ningas^-s^de^veaorormeMngapartof^ 

st^ds of a double-stranded vector that includes within its sequence a DNA 
sequencer, encodesthe desired pep«de. An oligonucteoMe pHmer beanng 
» desired mutated sequence Is prepared, genera,, sy*et«y. Thrs pnmer 
isthen annealed with the singte-stranded vector, and subjected to DNA 

is formed wherein one strand encodes the onginal non-muiated sequence and 
„ Ihe second strand bearsthe deslrad mutation. Thrs l«terodup,ex*ctor . then 
used to transform appropriate cells, such as E oo« cells, and ciones are 
selected which include recombinant vectors bearing me mutated sequence 
arrangement^ ^ ^ ^ 

30 encoding DNA segments using directed mutagenesis provides a means of 
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other ways in which sequence variants of peptides and the DNA sequences 
encoding them may be obtained. For example, recombinant vectors encoding 
the desired peptide sequence may be treated with mutagenic agents, such as 
hydroxylamine, to obtain sequence variants. Specific details regarding these 
5 methods and protocols are found in the teachings of Maloy et a/ M 1 994; Segal, 
1976; Prokop and Bajpai, 1991; Kuby, 1994; and Maniatis et a/., 1982, each 
incorporated herein by reference, for that purpose. 

As used herein, the term "oligonucleotide directed mutagenesis 
procedure" refers to template-dependent processes and vector-mediated 

1 0 propagation which result in an increase in the concentration of a specific 

nucleic acid molecule relative to its initial concentration, or in an increase in the 
concentration of a detectable signal, such as amplification. As used herein, the 
term "oligonucleotide directed mutagenesis procedure" is intended to refer to a 
process that involves the template-dependent extension of a primer molecule. 

15 The term template dependent process refers to nucleic acid synthesis of an 
RNA or a DNA molecule wherein the sequence of the newly synthesized strand 
of nucleic acid is dictated by the well-known rules of complementary base 
pairing (see, for example, Watson, 1987). Typically, vector mediated 
methodologies involve the introduction of the nucleic acid fragment into a DNA 

20 or RNA vector, the clonal amplification of the vector, and the recovery of the 
amplified nucleic acid fragment. Examples of such methodologies are provided 
by U.S. Patent No. 4,237,224, specifically incorporated herein by reference in 
its entirety. 

In another approach for the production of polypeptide variants of 
25 the present invention, recursive sequence recombination, as described in U.S. 
Patent No. 5,837,458, may be employed. In this approach, iterative cycles of 
recombination and screening or selection are performed to "evolve" individual 
polynucleotide variants of the invention having, for example, enhanced 
immunogenic activity. 
30 In other embodiments of the present invention, the polynucleotide 

sequences provided herein can be advantageously used as probes or primers 
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for nucleic acid hybridization. As such, rt is contemplated that nucleic acid 
segments that comprise or consist of a sequence region of at least about a 1 5 
nucleotide long contiguous sequence that has the same sequence as, or » 
comptementary to, a 15 nucleotide long contiguous sequence disclosed here.n 
5 will find particular utility. Longer contiguous identical or complementary 

sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1 000 (Including 
all intermediate lengths) and even up to full length sequences will also be of 

use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to 
10 a sequence of interest will enable them to be of use in detecting the presence 
of complementary sequences in a given sample. However, other uses are also 
envisioned, such as the use of the sequence information for the preparation of 
mutant species primers, or primers for use in preparing other genetic 
constructions. 

! 5 Polynucleotide molecules having sequence regions consisting of 

contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 
nucleotides or so (including intermediate lengths as well), identical or 
complementary to a polynucleotide sequence disclosed herein, are particularly 
contemplated as hybridization probes for use in, e.g., Southern and Northern 
20 blotting. This would allow a gene product, or fragment thereof, to be analyzed, 
both in drverse cell types and also in various bacterial cells. The total s.ze of 
fragment, as well as the size of the complementary stretch(es), will ultimately 
depend on the intended use or application of the particular nucleic acid 
segment. Smaller fragments will generally find use in hybridization 
25 embodiments, wherein the length of the contiguous complementary region may 
be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 
complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in 
30 length allows the formation of a duplex molecule that te both stable and 
selective. Molecules having contiguous complementary sequences over 
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stretches greater than 15 bases in length are generally preferred, though, in 
order to increase stability and selectivity of the hybrid, and thereby improve the 
quality and degree of specific hybrid molecules obtained. One will generally 
prefer to design nucleic acid molecules having gene-complementary stretches 

5 of 15 to 25 contiguous nucleotides, or even longer where desired. 

Hybridization probes may be selected from any portion of any of 
the sequences disclosed herein. All that is required is to review the sequences 
set forth herein, or to any continuous portion of the sequences, from about 15- 
25 nucleotides in length up to and including the full length sequence, that one 

10 wishes to utilize as a probe or primer. The choice of probe and primer 

sequences may be governed by various factors. For example, one may wish to 
employ primers from towards the termini of the total sequence. 

Small polynucleotide segments or fragments may be readily 
prepared by, for example, directly synthesizing the fragment by chemical 

15 means, as is commonly practiced using an automated oligonucleotide 
synthesizer. Also, fragments may be obtained by application of nucleic acid 
reproduction technology, such as the PCR™ technology of U.S. Patent 
4,683,202 (incorporated herein by reference), by introducing selected 
sequences into recombinant vectors for recombinant production, and by other 

20 recombinant DNA techniques generally known to those of skill in the art of 
molecular biology. 

The nucleotide sequences of the invention may be used for their 
ability to selectively form duplex molecules with complementary stretches of the 
entire gene or gene fragments of interest. Depending on the application 

25 envisioned, one will typically desire to employ varying conditions of 

hybridization to achieve varying degrees of selectivity of probe towards target 
sequence. For applications requiring high selectivity, one will typically desire to 
employ relatively stringent conditions to form the hybrids, e.g., one will select 
relatively low salt and/or high temperature conditions, such as provided by a 

30 salt concentration of from about 0.02 M to about 0.1 5 M salt at temperatures of 
from about 50°C to about 70°C. Such selective conditions tolerate little, if any, 

37 



- • a « . , ^,ioi21%2E419C12/FefcWW^ 
WQ03013431IMle^ ~ 

PCTAJS02/24917 

WO 03/013431 



mismatch between the probe and the template or target strand, and would be 
particularly suitable for isolating related sequences. 

Of course, for some applications, for example, where one desires 
to prepare mutants employing a mutant primer strand hybridized to an 
5 underlying template, less stringent (reduced stringency) hybridization cond.t.ons 
will typically be needed in order to allow formation of the heteroduplex. In 
these circumstances, one may desire to employ salt conditions such as those 
of from about 0.1 5 M to about 0.9 M salt, at temperatures ranging from about 
20°C to about 55°C. Cross-hybridizing species can thereby be readily 
1 o identified as positively hybridizing signals with respect to control hybridizations, 
in any case, it is generalh, appreciated that condrtions can be rendered more 
stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. 
Thus, hybridization conditions can be readily manipulated, and thus will 
15 generally be a method of choice depending on the desired results. 

According to another embodiment of the present invention, 
polynucleotide compositions comprising antisense oligonucleotides are 
provided. Antfeense oligonucleotides have been demonstrated to be effects 
and targeted inhibitors of protein synthesis, and, consequently, provide a 
20 therapeutic approach by which a disease can be treated by inhibiting the 
synthesis of proteins that contribute to the disease. The efficacy of antisense 
oligonucleotides for inhibiting protein synthesis is well established. For 
example, the synthesis of polygalacturonase and the muscarine type 2 
acetylcholine receptor are inhibited by antisense oligonucleotides directed to 
25 their respective mRNA sequences (U.S. Patent 5,739,1 19 and U.S. Patent 
5,759,829). Further, examples of antisense inhibition have been demonstrated 
with the nuclear protein cyclin, the multiple drug resistance gene (MDG1), 
,CAM-1 , E-selectin, STK-1 , striatal GABA A receptor and human EGF (Jaskulsk. 
at a/., Science. 1988 Jun 10;240(4858):1 544-6; Vasanthakumar and Ahmed, 
30 CancerCommun. 1989;1(4):225-32; Peris ef a/., Brain Res Mol Brain Res. 
1998 Jun 15;57(2):310-20; U.S. Patent 5,801 ,154; U.S. Patent 5,789,573; U.S. 
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Patent 5,718,709 and U.S. Patent 5,610,288). Antisense constructs have also 
been described that inhibit and can be used to treat a variety of abnormal 
cellular proliferations, e.g., cancer (U.S. Patent 5,747,470; U.S. Patent 
5,591,317 and U.S. Patent 5,783,683). 
5 Therefore, in certain embodiments, the present invention provides 

oligonucleotide sequences that comprise all, or a portion of, any sequence that 
is capable of specifically binding to polynucleotide sequence described herein, 
or a complement thereof. In one embodiment, the antisense oligonucleotides 
comprise DNA or derivatives thereof. In another embodiment, the 

10 oligonucleotides comprise RNA or derivatives thereof. In a third embodiment, 
the oligonucleotides are modified DNAs comprising a phosphorothioated 
modified backbone. In a fourth embodiment, the oligonucleotide sequences 
comprise peptide nucleic acids or derivatives thereof. In each case, preferred 
compositions comprise a sequence region that is complementary, and more 

15 preferably substantially-complementary, and even more preferably, completely 
complementary to one or more portions of polynucleotides disclosed herein. 
Selection of antisense compositions specific for a given gene sequence is 
based upon analysis of the chosen target sequence and determination of 
secondary structure, T m , binding energy, and relative stability. Antisense 

20 compositions may be selected based upon their relative inability to form dimers, 
hairpins, or other secondary structures that would reduce or prohibit specific 
binding to the target mRNA in a host cell. Highly preferred target regions of the 
mRNA, are those which are at or near the AUG translation initiation codon, and 
those sequences which are substantially complementary to 5' regions of the 

25 mRNA. These secondary structure analyses and target site selection 

considerations can be performed, for example, using v.4 of the OLIGO primer 
analysis software and/or the BLASTN 2.0.5 algorithm software (Altschul et a/., 
Nucleic Acids Res. 1997, 25(17):3389-402). 

The use of an antisense delivery method employing a short 

30 peptide vector, termed MPG (27 residues), is also contemplated. The MPG 
peptide contains a hydrophobic domain derived from the fusion sequence of 
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HIV g P 41 and a hydrophilic domain from the nuclear localization sequence of 
SV40 T-antigen (Morris ef a/., Nucleic Acids Res. 1997 Jul 15;25(14):2730-6). 
,t has been demonstrated that several mo.ecules of the MPG peptide coat the 
antisense oligonucleotides and can be de.K/ered into cultured mammalian cells 
5 in less than 1 hour with relatively high efficiency (90%). Further, the interaction 
with MPG strongly increases both the stability of the oligonucleotide to 
nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the 
polynucleotide compositions described herein are used in the design and 
10 preparation of ribozyme molecules for inhibiting expression of the tumor 
polypeptides and proteins of the present invention in tumor cells. Ribozymes 
are RNA-protein complexes that cleave nucleic acids in a site-specific fash.on. 
Ribozymes have specific catalytic domains that possess endonuclease activrty 
«<lm and Cech, Proc Natl Acad Sd U S A. 1987 Dec;84(24):8788-92; Forster 
15 and Symons, Cell. 1987 Apr 24;49(2):21 1-20). For example, a large number ot 
ribozymes accelerate phosphoester transfer reactions with a high degree of 
specificity, often cleaving only one of several phosphoesters in an 
oligonucleotide substrate (Cech et a/., Cel.. 1981 Dec;27(3 Pt 2):487-96; Michel 
and Westhof, J Mol Biol. 1990 Dec5;216(3):585-610; Reinhold-Hurek and 
20 Shub, Nature. 1992 May 14,357(6374) : 173-6). This specificity has been 
attributed to the requirement that the substrate bind via specific base- P a.nng 
interactions to the internal guide sequence ("IGS") of the ribozyme prior to 

chemical reaction. 

Six basic varieties of naturally-occurring enzymatic RNAs are 

25 known presently. Each can catalyze the hydrolysis of RNA phosphodiester 
bonds in trans (and thus can cleave other RNA molecules) under physiolog.cal 
conditions. In general, enzymatic nucleic acids act by first binding to a target 
RNA. Such binding occurs through the target binding portion of a enzymatic 
nucleic acid which fe held in close proximity to an enzymatic portion of the 

30 molecule that acts to cleave the target RNA. Thus, the enzymatic nucle.c acrf 
first recognizes and then binds a target RNA through complementary base- 
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pairing, and once bound to the correct site, acts enzymatically to cut the target 
RNA. Strategic cleavage of such a target RNA will destroy its ability to direct 
synthesis of an encoded protein. After an enzymatic nucleic acid has bound 
and cleaved its RNA target, it is released from that RNA to search for another 
5 target and can repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over many 
technologies, such as antisense technology (where a nucleic acid molecule 
simply binds to a nucleic acid target to block its translation) since the 
concentration of ribozyme necessary to affect a therapeutic treatment is lower 

1 0 than that of an antisense oligonucleotide. This advantage reflects the ability of 
the ribozyme to act enzymatically. Thus, a single ribozyme molecule is able to 
cleave many molecules of target RNA. In addition, the ribozyme is a highly 
specific inhibitor, with the specificity of inhibition depending not only on the 
base pairing mechanism of binding to the target RNA, but also on the 

15 mechanism of target RNA cleavage. Single mismatches, or base-substitutions, 
near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their 
action (Woolf et a/., Proc Natl Acad Sci USA. 1992 Aug 15;89(16):7305-9). 
Thus, the specificity of action of a ribozyme is greater than that of an antisense 

20 oligonucleotide binding the same RNA site. 

The enzymatic nucleic acid molecule may be formed in a 
hammerhead, hairpin, a hepatitis 5 virus, group I intron or RNaseP RNA (in 
association with an RNA guide sequence) or Neurospora VS RNA motif. 
Examples of hammerhead motifs are described by Rossi et a/., Nucleic Acids 

25 Res. 1 992 Sep 1 1 ;20(1 7):4559-65. Examples of hairpin motifs are described 
by Hampel et at. (Eur. Pat Appl. Publ. No. EP 0360257), Hampel and Tritz, 
Biochemistry 1989 Jun 13;28(12):4929-33; Hampel et a/., Nucleic Acids Res. 
1990 Jan 25;18(2):299-304 and U.S. Patent 5,631 ,359. An example of the 
hepatitis 8 virus motif is described by Perrotta and Been, Biochemistry. 1992 

30 Dec 1 ;31 (47):1 1 843-52; an example of the RNaseP motif is described by 

Guerrier-Takada et a/., Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA 
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ribozyme motif is described by Collins (Saville and Collins, Cel.. 1990 May 
1 8 ;61 (4):685-96; Saville and Collins, Proc Natl Acad Sci USA.1 991 Oct 
1;88(19):8826-30; Collins and Olive, Biochemistry. 1993 Mar 23;32(11):2795- 
gy and an example of the Group I intron Is described in (U.S. Patent 
5 4 987 071). All that is important in an enzymatic nucleic acid molecule of this 
invention is that it has a specific substrate binding site which is complementary 
to one or more of the target gene RNA regions, and that It have nucleotide 
sequences within or surrounding that substrate binding alto which impart an 
RNA cleaving activity to the molecule. Thus the ribozyme constructs need not 
1 o be limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat. Appl. PubL 
No WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically 
incorporated herein by reference) and synthesized to be tested in vitro and ,n 
vivo as described. Such ribozymes can also be optimfeed for delivery. Wh.le 
15 specrficexamp.es are provided, those in the art will recognize that equivalent 
RNA targets in other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 
ribozyme binding arms, or chemically synthesizing ribozymes with modifications 
that prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. 
20 Publ No. WO 92/07065; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. 
Publ No WO 91/03162; Eur. Pat. Appl. Publ. No. 921 10298.4; U.S. Patent 
5 334,711; and Int. Pat. Appl. Publ. No. WO 94/13688, which describe various 
chemical medications that can be made to the sugar moieties of enzymafc 
RNA molecules), modifications which enhance their efficacy in cells, and 
25 removal of stem II bases to shorten RNAsynthesis times and reduce chem.cal 
requirements. 

Sullivan etal. (Int. Pat. Appl. Publ. No. WO 94/02595) describes 
the general methods for delivery of enzymatic RNA molecules. Ribozymes 
may be administered to cells by a variety of methods known to those famihar to 
30 the art, including, but not restricted to, encapsulation in liposomes, by 
iontophoresis, or by incorporation into other vehicles, such as hydrogels, 
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cyclodextrins, biodegradable nanocapsules, and bioadhesive microspheres. 
For some indications, ribozymes may be directly delivered ex vivo to cells or 
tissues with or without the aforementioned vehicles. Alternatively, the 
RNA/vehicle combination may be locally delivered by direct inhalation, by direct 
5 injection or by use of a catheter, infusion pump or stent Other routes of 
delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), 
topical, systemic, ocular, intraperitoneal and/or intrathecal delivery. More 
detailed descriptions of ribozyme delivery and administration are provided in 

10 Int. Pat. Appl. Publ. No. WO 94/02595 and Int. Pat. Appl. Publ. No. WO 
93/23569, each specifically incorporated herein by reference. 

Another means of accumulating high concentrations of a 
ribozyme(s) within cells is to incorporate the ribozyme-encoding sequences into 
a DNA expression vector. Transcription of the ribozyme sequences are driven 

15 from a promoter for eukaryotic RNA polymerase I (pol I), RNA polymerase II 
(pol II), or RNA polymerase III (pol III). Transcripts from pol II or pol III 
promoters will be expressed at high levels in all cells; the levels of a given pol II 
promoter in a given cell type will depend on the nature of the gene regulatory 
sequences (enhancers, silencers, ete.) present nearby. Prokaryotic RNA 

20 polymerase promoters may also be used, providing that the prokaryotic RNA 
polymerase enzyme is expressed in the appropriate cells Ribozymes 
expressed from such promoters have been shown to function in mammalian 
cells. Such transcription units can be incorporated into a variety of vectors for 
introduction into mammalian cells, including but not restricted to, plasmid DNA 

25 vectors, viral DNA vectors (such as adenovirus or adeno-associated vectors), 
or viral RNA vectors (such as retroviral, semliki forest virus, sindbis virus 
vectors). 

In another embodiment of the invention, peptide nucleic acids 
(PNAs) compositions are provided. PNA is a DNA mimic in which the 
30 nucleobases are attached to a pseudopeptide backbone (Good and Nielsen, 
Antisense Nucleic Acid Drug Dev. 1997 7(4) 431-37). PNA is able to be utilized 
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in a number methods that traditionally have used RNA or DNA. Often PNA 
sequences perform better in techniques than the corresponding RNA or DNA 
sequences and have utilities that are not inherent to RNA or DNA. A review of 
PNA including methods of making, characteristics of, and methods of using, is 
5 P rovidedbyCorey(TfBndse/ofecf)no/1997Jun;15(6):224-9). Assuch.in 
certain embodiments, one may prepare PNA sequences that are 
complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the 
translation of ACE-specific mRNA, and thereby alter the level of ACE activity in 
10 a host cell to which such PNA compositions have been administered. 

PNAs have 2-aminoethyl-glycine linkages replacing the normal 
phosphodiester backbone of DNA (Nielsen et al., Science 1991 Dec 
6;254(5037):1497-500; Hanvey etal., Science. 1992 Nov 27;258 (5087): 1481- 
5; Hyrup and Nielsen, Bioorg Med Chem. 1996 Jan;4(1):5-23). This chemistry 
1 5 has three important consequences: firstly, in contrast to DNA or 

phosphorothioate oligonucleotides, PNAs are neutral molecules; secondly, 
PNAs are achiral, which avoids the need to develop a stereoselective 
synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc protocols for 
solid-phase peptide synthesis, although other methods, including a modified 
20 Merrifield method, have been used. 

PNA monomers or ready-made oligomers are commercially 
available from PerSeptive Biosystems (Framingham, MA). PNA syntheses by 
either Boc or Fmoc protocols are straightforward using manual or automated 
protocols (Norton et al., Bioorg Med Chem. 1995 Apr;3(4):437-45). The 
25 manual protocol lends itself to the production of chemically modified PNAs or 
the simultaneous synthesis of families of closely related PNAs. 

As with peptide synthesis, the success of a particular PNA 
synthesis will depend on the properties of the chosen sequence. For example, 
while in theory PNAs can incorporate any combination of nucleotide bases, the 
30 presence of adjacent purines can lead to deletions of one or more residues in 
the product. In expectation of this difficulty, it is suggested that, in producing 
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PNAs with adjacent purines, one should repeat the coupling of residues likely 
to be added inefficiently. This should be followed by the purification of PNAs 
by reverse-phase high-pressure liquid chromatography, providing yields and 
purity of product similar to those observed during the synthesis of peptides. 
5 Modifications of PNAs for a given application may be 

accomplished by coupling amino acids during solid-phase synthesis or by 
attaching compounds that contain a carboxylic acid group to the exposed N- 
terminal amine. Alternatively, PNAs can be modified after synthesis by 
coupling to an introduced lysine or cysteine. The ease with which PNAs can be 

1 0 modified facilitates optimization for better solubility or for specific functional 
requirements. Once synthesized, the identity of PNAs and their derivatives can 
be confirmed by mass spectrometry. Several studies have made and utilized 
modifications of PNAs (for example, Norton et al., Bioorg Med Chem. 1995 
Apr;3(4):437-45; Petersen et al., J Pept Sci. 1995 May-Jun;1 (3):1 75-83; Orum 

1 5 et al., Biotechniques. 1 995 Sep;1 9(3):472-80; Footer et al., Biochemistry. 1996. 
Aug 20;35(33):1 0673-9; Griffith et al., Nucleic Acids Res. 1995 Aug 
1 1 ;23(15):3003-8; Pardridge et al., Proc Natl Acad Sci USA. 1995 Jun 
6;92(1 2):5592-6; Boffa ef al., Proc Natl Acad Sci U S A. 1 995 Mar 
14;92(6):1901-5; Gambacorti-Passerini etal., Blood. 1996 Aug 15;88(4):1411- 

20 7; Armitage et al., Proc Natl Acad Sci U S A. 1997 Nov 1 1 ;94(23):12320-5; 
Seeger et al., Biotechniques. 1997 Sep;23(3):512-7). U.S. Patent No. 
5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions 
susceptible to therapeutics. 

25 Methods of characterizing the antisense binding properties of 

PNAs are discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and 
Jensen et al. (Biochemistry. 1 997 Apr 22;36(1 6):5072-7). Rose uses capillary 
gel electrophoresis to determine binding of PNAs to their complementary 
oligonucleotide, measuring the relative binding kinetics and stoichiometry. 

30 Similar types of measurements were made by Jensen et al. using BIAcore™ 
technology. 
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Other applications of PNAs that have been described and will be 
apparent to the skilled artisan include use in DNA strand invasion, antisense 
inhibition, mutational analysis, enhancers of transcription, nucleic acid 
purif ication, isolation of transcriptionally active genes, blocking of transcription 
5 factor binding, genome cleavage, biosensors, in situ hybridization, and the like. 

pp. vm. mi prrnDE Identification. Charactfri7ation a nd Expression 

Polynucleotides compositions of the present invention may be 
identified, prepared and/or manipulated using any of a variety of well 
established techniques (see generally, Sambrook et al., Molecular Cloning: A 
1 o Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 
1989, and other like references). For example, a polynucleotide may be 
identified, as described in more detail below, by screening a microarray of 
cDN As for tumor-associated expression (i.e., expression that is at least two fold 
greater in a tumor than in normal tissue, as determined using a representative 

1 5 assay provided herein). Such screens may be performed, for example, using 
the microarray technology of Affymetrix, Inc. (Santa Clara, CA) according to the 
manufacturer's instructions (and essentially as described by Schena et al., 
Proc. Natl. Acad. Sci. USA 93:10614-10619, 1996 and Heller et al., Proc. Natl. 
Acad. Sci. USA 94:2150-2155, 1997). Alternatively, polynucleotides may be 

20 amplified from cDNA prepared from cells expressing the proteins described 

herein, such as tumor cells. 

Many template dependent processes are available to amplify a 
target sequences of interest present in a sample. One of the best known 
amplification methods is the polymerase chain reaction (PCR W ) which is 

25 described in detail in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159, 
each of which is incorporated herein by reference in its entirety. Briefly, in 
PCR™ , two primer sequences are prepared which are complementary to 
regions on opposite complementary strands of the target sequence. An excess 
of deoxynucleoside triphosphates is added to a reaction mixture along with a 

30 DNA polymerase (e.g., Taq polymerase). If the target sequence is present in a 
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sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. 
By raising and lowering the temperature of the reaction mixture, the extended 
primers will dissociate from the target to form reaction products, excess primers 
5 will bind to the target and to the reaction product and the process is repeated. 
Preferably reverse transcription and PCR™ amplification procedure may be 
performed in order to quantify the amount of mRNA amplified. Polymerase 
chain reaction methodologies are well known in the art. 

Any of a number of other template dependent processes, many of 

1 0 which are variations of the PCR ™ amplification technique, are readily known 
and available in the art Illustratively, some such methods include the ligase 
chain reaction (referred to as LCR), described, for example, in Eur. Pat, Appl. 
Publ. No. 320,308 and U.S. Patent No. 4,883,750; Qbeta Replicase, described 
in PCT Intl. Pat. Appl. Publ. No. PCT/US87/00880; Strand Displacement 

15 Amplification (SDA) and Repair Chain Reaction (RCR). Still other amplification 
methods are described In Great Britain Pat. Appl. No. 2 202 328, and in PCT 
Intl. Pat. Appl. Publ. No. PCT/US89/01025. Other nucleic acid amplification 
procedures include transcription-based amplification systems (TAS) (PCT Intl. 
Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence based 

20 amplification (NASBA) and 3SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
nucleic acid amplification process involving cyclically synthesizing 
single-stranded RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). 
PCT Intl. Pat. Appl. Publ. No. WO 89/06700 describes a nucleic acid sequence 
amplification scheme based on the hybridization of a promoter/primer 

25 sequence to a target single-stranded DNA ("ssDNA") followed by transcription 
of many RNA copies of the sequence. Other amplification methods such as 
"RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 1989) are also well- 
known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention 

30 may be used to isolate a full length gene from a suitable library (e.g., a tumor 
cDNA library) using well known techniques. Within such techniques, a library 
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(cDNA or genomic) is screened using one or more polynucleotide probes or 
primers suitable for amplification. Preferably, a library is size-selected to 
include larger molecules. Random primed libraries may also be preferred for 
identifying 5' and upstream regions of genes. Genomic libraries are preferred 
5 for obtaining introns and extending 5' sequences. 

For hybridization techniques, a partial sequence may be labeled 
(e.g., by nick-translation or end-labeling with *P) using well known techniques. 
A bacterial or bacteriophage library is then generally screened by hybridizing 
filters containing denatured bacterial colonies (or lawns containing phage 
1 0 plaques) with the labeled probe (see Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 
1989). Hybridizing colonies or plaques are selected and expanded, and the 
DNA is isolated for further analysis. cDNA clones may be analyzed to 
determine the amount of additional sequence by, for example, PCR using a 
1 5 primer from the partial sequence and a primer from the vector. Restriction 
maps and partial sequences may be generated to identify one or more 
overlapping clones. The complete sequence may then be determined using 
standard techniques, which may involve generating a series of deletion clones. 
The resulting overlapping sequences can then assembled into a single 
20 contiguous sequence. A full length cDNA molecule can be generated by 
ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described 
above, can be useful for obtaining a full length coding sequence from a partial 
cDNA sequence. One such amplification technique is inverse PCR (see Triglia 
25 et al., Nucl. Acids Res. 76:81 86, 1988), which uses restriction enzymes to 
generate a fragment in the known region of the gene. The fragment is then 
circularized by intramolecular ligation and used as a template for PCR with 
divergent primers derived from the known region. Within an alternative 
approach, sequences adjacent to a partial sequence may be retrieved by 
30 amplification with a primer to a linker sequence and a primer specific to a 
known region. The amplified sequences are typically subjected to a second 
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round of amplification with the same linker primer and a second primer specific 
to the known region. A variation on this procedure, which employs two primers 
that initiate extension in opposite directions from the known sequence, is 
described in WO 96/38591 . Another such technique is known as "rapid 
5 amplification of cDNA ends" or RACE. This technique involves the use of an 
internal primer and an external primer, which hybridizes to a polyA region or 
vector sequence, to identify sequences that are 5' and 3' of a known sequence. 
Additional techniques include capture PCR (Lagerstrom et al., PCR Methods 
Applic. 7:111-19, 1991) and walking PCR (Parker et al., Nucl. Acids. Res. 

1 0 79:3055-60, 1 991 ). Other methods employing amplification may also be 
employed to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA 
sequence by analysis of sequences provided in an expressed sequence tag 
(EST) database, such as that available from GenBank. Searches for 

1 5 overlapping ESTs may generally be performed using well known programs 
{e.g., NCBI BLAST searches), and such ESTs may be used to generate a 
contiguous full length sequence. Full length DNA sequences may also be 
obtained by analysis of genomic fragments. 

In other embodiments of the invention, polynucleotide sequences 

20 or fragm ents thereof which encode polypeptides of the invention , or fusion 
proteins or functional equivalents thereof, may be used in recombinant DNA 
molecules to direct expression of a polypeptide in appropriate host cells. Due 
to the inherent degeneracy of the genetic code, other DNA sequences that 
encode substantially the same or a functionally equivalent amino acid 

25 sequence may be produced and these sequences may be used to clone and 
express a given polypeptide. 

As will be understood by those of skill in the art, it may be 
advantageous in some instances to produce polypeptide-encoding nucleotide 
sequences possessing non-naturally occurring codons. For example, codons 

30 preferred by a particular prokaryotic or eukaryotic host can be selected to 
increase the rate of protein expression or to produce a recombinant RNA 
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transcript having desirable properties, such as a half-life which is longer than 
that of a transcript generated from the naturally occurring sequence. 

Moreover, the polynucleotide sequences of the present invention 
can be engineered using methods generally known in the art in order to alter 

5 polypeptide encoding sequences for a variety of reasons, including but not 
limited to, alterations which modify the cloning, processing, and/or expression 
of the gene product. For example, DNA shuffling by random fragmentation and 
PCR reassembly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. In addition, site-directed 

10 mutagenesis may be used to insert new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, or introduce 

mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences may be ligated to a heterologous 

1 5 sequence to encode a fusion protein. For example, to screen peptide libraries 
for inhibitors of polypeptide activity, it may be useful to encode a chimeric 
protein that can be recognized by a commercially available antibody. A fusion 
protein may also be engineered to contain a cleavage site located between the 
polypeptide-encoding sequence and the heterologous protein sequence, so 

20 that the polypeptide may be cleaved and purified away from the heterologous 
moiety. 

Sequences encoding a desired polypeptide may be synthesized, 
in whole or in part, using chemical methods well known in the art (see 
Caruthers, M. H. et al. (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn, T. et 

25 al. (1980) Nucl. Acids Res. Symp. Ser. 225-232). Alternatively, the protein itself 
may be produced using chemical methods to synthesize the amino acid 
sequence of a polypeptide, or a portion thereof. For example, peptide 
synthesis can be performed using various solid-phase techniques (Roberge, J. 
Y. et al. (1995) Science 269:202-204) and automated synthesis may be 

30 achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, 
Palo Alto, CA). 
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A newly synthesized peptide may be substantially purified by 
preparative high performance liquid chromatography (e.g., Creighton, T. (1983) 
Proteins, Structures and Molecular Principles, WH Freeman and Co., New 
York, N.Y.) or other comparable techniques available in the art. The 
5 composition of the synthetic peptides may be confirmed by amino acid analysis 
or sequencing (e.g., the Edman degradation procedure). Additionally, the 
amino acid sequence of a polypeptide, or any part thereof, may be altered 
during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant 
10 polypeptide. 

In order to express a desired polypeptide, the nucleotide 
sequences encoding the polypeptide, or functional equivalents, may be 
inserted into appropriate expression vector, /.e., a vector which contains the 
necessary elements for the transcription and translation of the inserted coding 

1 5 sequence. Methods which are well known to those skilled in the art may be 
used to construct expression vectors containing sequences encoding a 
polypeptide of interest and appropriate transcriptional and translation^ control 
elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. Such techniques are 

20 described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 

Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. 
M. et al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, 
New York. N.Y. 

A variety of expression vector/host systems may be utilized to 
25 contain and express polynucleotide sequences. These include, but are not 
limited to, microorganisms such as bacteria transformed with recombinant 
bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed 
with yeast expression vectors; insect cell systems infected with virus expression 
vectors (e.g., baculovirus); plant cell systems transformed with virus expression 
30 vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
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with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell 
systems. 

The "control elements" or "regulatory sequences" present in an 
expression vector are those non-translated regions of the vector-enhancers, 
5 promoters, 5' and 3' untranslated regions-which interact with host cellular 
proteins to carry out transcription and translation. Such elements may vary in 
their strength and specificity. Depending on the vector system and host 
utilized, any number of suitable transcription and translation elements, including 
constitutive and inducible promoters, may be used. For example, when cloning 
10 in bacterial systems, inducible promoters such as the hybrid lacZ promoter of 
the pBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or pSPORTI 
plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from 
mammalian viruses are generally preferred. If it is necessary to generate a cell 
1 5 line that contains multiple copies of the sequence encoding a polypeptide, 
vectors based on SV40 or EBV may be advantageously used with an 
appropriate selectable marker. 

In bacterial systems, any of a number of expression vectors may 
be selected depending upon the use intended for the expressed polypeptide. 
20 For example, when large quantities are needed, for example for the induction of 
antibodies, vectors which direct high level expression of fusion proteins that are 
readily purified may be used. Such vectors include, but are not limited to, the 
multifunctional E. coli cloning and expression vectors such as pBLUESCRIPT 
(Stratagene), in which the sequence encoding the polypeptide of interest may 
25 be ligated into the vector in frame with sequences for the amino-terminal Met 
and the subsequent 7 residues of .beta.-galactosidase so that a hybrid protein 
is produced; pIN vectors (Van Heeke, G. and S. M. Schuster (1989) J. Biol. 
Chem. 264:5503-5509); and the like. pGEX Vectors (Promega, Madison, Wis.) 
may also be used to express foreign polypeptides as fusion proteins with 
30 glutathione S-transferase (GST). In general, such fusion proteins are soluble 
and can easily be purified from lysed cells by adsorption to glutathione-agarose 
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beads followed by elution in the presence of free glutathione. Proteins made in 
such systems may be designed to include heparin, thrombin, or factor XA 
protease cleavage sites so that the cloned polypeptide of interest can be 
released from the GST moiety at will. 
5 In the yeast, Saccharomyces cerevisiae, a number of vectors 

containing constitutive or inducible promoters such as alpha factor, alcohol 
oxidase, and PGH may be used. For reviews, see Ausubel et al. (supra) and 
Grant et al. (1 987) Methods EnzymoL 1 53:51 6-544. 

In cases where plant expression vectors are used, the expression 

1 0 of sequences encoding polypeptides may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S promoters 
of CaMV may be used alone or in combination with the omega leader 
sequence from TMV (Takamatsu, N. (1 987) EMBO J. 6:307-31 1 . Alternatively, 
plant promoters such as the small subunit of RUBISCO or heat shock 

1 5 promoters may be used (Coruzzi, G. et al. (1 984) EMBO J. 3:1 671 -1 680; 
Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) 
Results Probi Cell Differ. 1 7: 85-1 05). These constructs can be introduced into 
plant cells by direct DNA transformation or pathogen-mediated transfection. 
Such techniques are described in a number of generally available reviews (see, 

20 for example, Hobbs, S. or Murry, L E. in McGraw Hill Yearbook of Science and 
Technology (1992) McGraw Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of 
interest. For example, in one such system, Autographa californica nuclear 
polyhedrosis virus (AcNPV) is used as a vector to express foreign genes in 

25 Spodoptera f rugiperda cells or in Trichoplusia larvae. The sequences encoding 
the polypeptide may be cloned into a non-essential region of the virus, such as 
the polyhedrin gene, and placed under control of the polyhedrin promoter. 
Successful insertion of the polypeptide-encoding sequence will render the 
polyhedrin gene inactive and produce recombinant virus lacking coat protein. 

30 The recombinant viruses may then be used to infect, for example, S. frugiperda 
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cells orTrichoplusia larvae in which the polypeptide of interest may be 
expressed (Engelhard, E. K. et al. (1994) Proc. Nat,. Acad. Sci. 91 :3224-3227). 

In mammalian host cells, a number of viral-based express.on 
systems are generally available. For example, in cases where an adenovirus is 
5 used as an expression vector, sequences encoding a polypeptide of interest 
maybe ligated into an adenovirus transcriptionAranslation complex consisting 
of the late promoter and tripartite leader sequence. Insertion in a non-essent.al 
E1 or E3 region of the viral genome may be used to obtain a viable virus wh.ch 
is capable of expressing the polypeptide in infected host cells (Logan, J. and 
10 Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655-3659). In addition, 

transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may 
be used to increase expression in mammalian host cells. 

Specific initiation signals may also be used to achieve more 
efficient translation of sequences encoding a polypeptide of interest. Such 
15 signals include the ATG initiation codon and adjacent sequences. In cases 
where sequences encoding the polypeptide, its initiation codon, and upstream 
sequences are inserted into the appropriate expression vector, no additional 
transcriptional or translations control signals may be needed. However, in 
cases where onry coding sequence, or a portion thereof, is inserted, exogenous 
20 translational control signals including the ATG initiation codon should be 
provided. Furthermore, the initiation codon should be in the correct reading 
frame to ensure translation of the entire insert. Exogenous translational 
elements and initiation codons may be of various origins, both natural and 
synthetic. The efficiency of expression may be enhanced by the inclus.on of 
25 enhancers which are appropriate for the particular cell system whteh is used, 
such as those described in the Irterature (Scharf, D. et al. (1994) Results Probl. 

Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to 
modulate the expression of the inserted sequences or to process the 
30 expressed protein in the desired fashion. Such modifications of the polypept.de 
include, but are not limited to, acetylation, carboxylation, glycosylation, 
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phosphorylation, lipidation, and acylation. Post-translational processing which 
cleaves a °prepro D form of the protein may also be used to facilitate correct 
insertion, folding and/or function. Different host cells such as CHO, COS, 
HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery and 
5 characteristic mechanisms for such post-translational activities, may be chosen 
to ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, 
stable expression is generally preferred. For example, cell lines which stably 
express a polynucleotide of interest may be transformed using expression 

1 0 vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a 
separate vector. Following the introduction of the vector, cells may be allowed 
to grow for 1-2 days in an enriched media before they are switched to selective 
media. The purpose of the selectable marker is to confer resistance to 

1 5 selection, and its presence allows growth and recovery of cells which 

successfully express the introduced sequences. Resistant clones of stably 
transformed cells may be proliferated using tissue cuiture techniques 
appropriate to the cell type. 

Any number of selection systems may be used to recover 

20 transformed cell lines. These include, but are not limited to, the herpes simplex 
virus thymidine kinase (Wigler, M. et al. (1977) Cell 11:223-32) and adenine 
phosphoribosyltransferase (Lowy, I. et al. (1990) Ce// 22:81 7-23) genes which 
can be employed in tk.sup.- or apitsup.- cells, respectively. Also, 
antimetabolite, antibiotic or herbicide resistance can be used as the basis for 

25 selection; for example, dhfr which confers resistance to methotrexate (Wigler, 
M. et al. (1980) Proc. Natl. Acad. Sci 77:3567-70); npt, which confers 
resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. 
et al (1981) J. MoL Biol. 1 50:1-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 

30 Additional selectable genes have been described, for example, trpB, which 
allows cells to utilize indole in place of tryptophan, or hisD, which allows cells to 
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utilize histinol in place of histidine (Hartman, S. C. and R. C. Mulligan (1988) 
Proc. Natl. Acad. Sci. 85:8047-51). The use of visible markers has gained 
popularity with such markers as arrthocyanins, beta-glucuronidase and its 
substrate GUS, and luciferase and its substrate luciferin, being widely used not 
5 only to identify transformants, but also to quantify the amount of transient or 
stable protein expression attributable to a specific vector system (Rhodes, C. A. 
etal. (1995) Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression 
suggests that the gene of interest is also present, its presence and expression 
1 o may need to be confirmed. For example, if the sequence encoding a 
polypeptide is inserted within a marker gene sequence, recombinant cells 
containing sequences can be identified by the absence of marker gene 
function. Alternatively, a marker gene can be placed in tandem with a 
polypeptide-encoding sequence under the control of a single promoter. 
15 Egression of the marker gene in response to induction or selection usually - 

indicates expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 
polynucleotide sequence may be identified by a variety of procedures known to 
those of skill in the art. These procedures include, but are not limited to, DNA- 
20 DNA or DNA-RNA hybridizations and protein bioassay or immunoassay 
techniques which include, for example, membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression 
of polynucleotide-encoded products, using either polyclonal or monoclonal 
25 antibodies specific for the product are known in the art. Examples include 
enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering 
epitopes on a given polypeptide may be preferred for some applications, but a 
30 competitive binding assay may also be employed. These and other assays are 
described, among other places, in Hampton, R. et al. (1990; Serological 
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Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. E. 
et al. (1983; J. Exp. Med. 158:1 21 1-1216). 

A wide variety of labels and conjugation techniques are known by 
those skilled in the art and may be used in various nucleic acid and amino acid 
5 assays. Means for producing labeled hybridization or PCR probes for detecting 
sequences related to polynucleotides include oligolabeling, nick translation, 
end-labeling or PCR amplification using a labeled nucleotide. Alternatively, the 
sequences, or any portions thereof may be cloned into a vector for the 
production of an mRNA probe. Such vectors are known in the art, are 

1 0 commercially available, and may be used to synthesize RNA probes in vitro by 
addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled 
nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may 
be used include radionuclides, enzymes, fluorescent, chemiluminescent, or 

15 chromogenic agents as well as substrates, cefaclors, inhibitors, magnetic 
particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest 
may be cultured under conditions suitable for the expression and recovery of 
the protein from cell culture. The protein produced by a recombinant cell may 

20 be secreted or contained intracellularly depending on the sequence and/or the 
vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides of the invention may be designed to contain 
signal sequences which direct secretion of the encoded polypeptide through a 
prokaryotic or eukaryotic cell membrane. Other recombinant constructions may 

25 be used to join sequences encoding a polypeptide of interest to nucleotide 
sequence encoding a polypeptide domain which will facilitate purification of 
soluble proteins. Such purification facilitating domains include, but are not 
limited to, metal chelating peptides such as histidine-tryptophan modules that 
allow purification on immobilized metals, protein A domains that allow 

30 purification on immobilized immunoglobulin, and the domain utilized in the 
FLAGS extension/affinity purification system (Immunex Corp., Seattle, Wash.). 
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The inclusion of cleavable linker sequences such as those specific for Factor 
XA or enterokinase (Invitrogen, San Diego, Calif.) between the purification 
domain and the encoded polypeptide may be used to facilitate purification. 
One such expression vector provides for expression of a fusion protein 

5 containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The 
histidine residues facilitate purification on IMIAC (immobilized metal ion affinity 
chromatography) as described in Porath, J. et al. (1992, Prot. Exp. Purif. 3:263- 
281) while the enterokinase cleavage site provides a means for purifying the 

1 0 desired polypeptide from the fusion protein. A discussion of vectors which 
contain fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell Biol. 
1 2:441 -453). 

In addition to recombinant production methods, polypeptides of 
the invention, and fragments thereof, may be produced by direct peptide 

15 synthesis using solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 
85:2149-2154). Protein synthesis may be performed using manual techniques 
or by automation. Automated synthesis may be achieved, for example, using 
Applied Biosystems431A Peptide Synthesizer (Perkin Elmer). Alternatively, 
various fragments may be chemically synthesized separately and combined 

20 using chemical methods to produce the full length molecule. 

ANTIBODY COMPOSITIONS. FRAGMFNTS THEREOF AND OTHER BINDIN G AGENTS 

According to another aspect, the present invention further 
provides binding agents, such as antibodies and antigen-binding fragments 
thereof, that exhibit immunological binding to a tumor polypeptide disclosed 

25 herein, or to a portion, variant or derivative thereof. An antibody, or antigen- 
binding fragment thereof, is said to "specifically bind," "immunogically bind," 
and/or is "immunologically reactive" to a polypeptide of the invention if it reacts 
at a detectable level (within, for example, an ELISA assay) with the 
polypeptide, and does not react detectably with unrelated polypeptides under 

30 similar conditions. 
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Immunological binding, as used in this context, generally refers to 
the non-covalent interactions of the type which occur between an 
immunoglobulin molecule and an antigen for which the immunoglobulin is 
specific. The strength, or affinity of immunological binding interactions can be 
5 expressed in terms of the dissociation constant (Kd) of the interaction, wherein 
a smaller Kd represents a greater affinity. Immunological binding properties of 
selected polypeptides can be quantified using methods well known in the art. 
One such method entails measuring the rates of antigen-binding site/antigen 
complex formation and dissociation, wherein those rates depend on the 

10 concentrations of the complex partners, the affinity of the interaction, and on 
geometric parameters that equally influence the rate in both directions. Thus, 
both the "on rate constant" (Kon) and the "off rate constant" (Koff) can be 
determined by calculation of the concentrations and the actual rates of 
association and dissociation. The ratio of Koff /Kon enables cancellation of all 

15 parameters not related to affinity, and is thus equal to the dissociation constant 
Kd. See, generally, Davies et al. (1990) Annual Rev. Biochem. 59:439-473. 

An "antigen-binding site," or "binding portion" of an antibody 
refers to the part of the immunoglobulin molecule that participates in antigen 
binding. The antigen binding site is formed by amino acid residues of the N- 

20 terminal variable ("V") regions of the heavy ( n H a ) and light ("L") chains. Three 
highly divergent stretches within the V regions of the heavy and light chains are 
referred to as "hypervariable regions" which are interposed between more 
conserved flanking stretches known as "framework regions," or "FRs". Thus 
the term u FR n refers to amino acid sequences which are naturally found 

25 between and adjacent to hypervariable regions in immunoglobulins. In an 
antibody molecule, the three hypervariable regions of a light chain and the 
three hypervariable regions of a heavy chain are disposed relative to each 
other in three dimensional space to form an antigen-binding surface. The 
antigen-binding surface is complementary to the three-dimensional surface of a 

30 bound antigen, and the three hypervariable regions of each of the heavy and 
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light chains are referred to as "complementarity-determining regions," or 
"CDRs." 

Binding agents may be further capable of differentiating between 
patients with and without a cancer, such as breast cancer, using the 
5 representative assays provided herein. For example, antibodies or other 
binding agents that bind to a tumor protein will preferably generate a signal 
indicating the presence of a cancer in at least about 20% of patients with the 
disease, more preferably at least about 30% of patients. Alternatively, or in 
addition, the antibody will generate a negative signal indicating the absence of 
1 0 the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement, biological 
samples (e.g., blood, sera, sputum, urine and/or tumor biopsies) from patients 
with and without a cancer (as determined using standard clinical tests) may be 
assayed as described herein for the presence of polypeptides that bind to the 
15 binding agent. Preferably, a statistically significant number of samples with and 
without the disease will be assayed. Each binding agent should satisfy the 
above criteria; however, those of ordinary skill in the art will recognize that 
binding agents may be used in combination to improve sensitivity. 

Any agent that satisfies the above requirements may be a binding 
20 agent. For example, a binding agent may be a ribosome, with or without a 
peptide component, an RNA molecule or a polypeptide. In a preferred 
embodiment, a binding agent is an antibody or an antigen-binding fragment 
thereof. Antibodies may be prepared by any of a variety of techniques known 
to those of ordinary skill in the art. See, e.g., Harlow and Lane, Antibodies: A 
25 Laboratory Manual, Cold Spring Harbor Laboratory, 1 988. In general, 

antibodies can be produced by cell culture techniques, including the generation 
of monoclonal antibodies as described herein, or via transfection of antibody 
genes into suitable bacterial or mammalian cell hosts, in order to allow for the 
production of recombinant antibodies. In one technique, an immunogen 
30 comprising the polypeptide is initially injected into any of a wide variety of 
mammals (e.g., mice, rats, rabbits, sheep or goats). In this step, the 
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polypeptides of this invention may serve as the immunogen without 
modification. Alternatively, particularly for relatively short polypeptides, a 
superior immune response may be elicited if the polypeptide is joined to a 
carrier protein, such as bovine serum albumin or keyhole limpet hemocyanin. 
5 The immunogen is injected into the animal host, preferably according to a 
predetermined schedule incorporating one or more booster immunizations, and 
the animals are bled periodically. Polyclonal antibodies specific for the 
polypeptide may then be purified from such antisera by, for example, affinity 
chromatography using the polypeptide coupled to a suitable solid support. 

1 o Monoclonal antibodies specific for an antigenic polypeptide of 

interest may be prepared, for example, using the technique of Kohler and 
Milstein, Eur. J. Immunol. 6:51 1-519, 1976, and improvements thereto. Briefly, 
these methods involve the preparation of immortal cell lines capable of 
producing antibodies having the desired specificity (/.e., reactivity with the 

15 polypeptide of interest). Such cell lines may be produced, for example, from . 
spleen cells obtained from an animal immunized as described above. The 
spleen cells are then immortalized by, for example, fusion with a myeloma cell 
fusion partner, preferably one that is syngeneic with the immunized animal. A 
variety of fusion techniques may be employed. For example, the spleen cells 

20 and myeloma cells may be combined with a nonionic detergent for a few 

minutes and then plated at low density on a selective medium that supports the 
growth of hybrid cells, but not myeloma cells. A preferred selection technique 
uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient 
time, usually about 1 to 2 weeks, colonies of hybrids are observed. Single 

25 colonies are selected and their culture supernatants tested for binding activity 
against the polypeptide. Hybridomas having high reactivity and specificity are 
preferred. 

Monoclonal antibodies may be isolated from the supernatants of 
growing hybridoma colonies. In addition, various techniques may be employed 
30 to enhance the yield, such as injection of the hybridoma cell line into the 
peritoneal cavity of a suitable vertebrate host, such as a mouse. Monoclonal 
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antibodies may then be harvested from the ascites fluid or the blood. 
Contaminants may be removed from the antibodies by conventional 
techniques, such as chromatography, gel filtration, precipitation, and extraction. 
The polypeptides of this invention may be used in the purification process in, 
5 for example, an affinity chromatography step. 

A number of therapeutically useful molecules are known in the art 
which comprise antigen-binding sites that are capable of exhibiting 
immunological binding properties of an antibody molecule. The proteolytic 
enzyme papain preferentially cleaves IgG molecules to yield several fragments, 
10 two of which (the "Ftab)' fragments) each comprise a covalent heterodimer that 
includes an intact antigen-binding site. The enzyme pepsin is able to cleave 
IgG molecules to provide several fragments, including the "F(ab')2 fragment 
which comprises both antigen-binding sites. An "Fv" fragment can be produced 
by preferential proteolytic cleavage of an IgM, and on rare occasions IgG or IgA 
1 5 immunoglobulin molecule. Fv fragments are, however, more commonly derived 
using recombinant techniques known in the art. The Fv fragment includes a 
non-covalent V H ::V L heterodimer including an antigen-binding site which retains 
much of the antigen recognition and binding capabilities of the native antibody 
molecule. Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; 
20 Hochman et al. (1 976) Biochem 1 5:2706-271 0; and Ehrlich et al. (1 980) 
Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalentry linked V H ::V L 
heterodimer which is expressed from a gene fusion including V H - and V L - 
encoding genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. 
25 Nat. Acad. Sci. USA 85(1 6):5879-5883. A number of methods have been 
described to discern chemical structures for converting the naturally 
aggregated-but chemically separated-light and heavy polypeptide chains from 
an antibody V region into an sFv molecule which will fold into a three 
dimensional structure substantially similar to the structure of an antigen-binding 
30 site. See, e.g., U.S. Pat. Nos. 5,091 ,513 and 5,132,405, to Huston et al.; and 
U.S. Pat No. 4,946,778, to Ladner et al. 
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Each of the above-described molecules includes a heavy chain 
and a light chain CDR set, respectively interposed between a heavy chain and 
a light chain FR set which provide support to the CDRS and define the spatial 
relationship of the CDRs relative to each other. As used herein, the term "CDR 
5 set" refers to the three hypervariable regions of a heavy or light chain V region. 
Proceeding from the N-terminus of a heavy or light chain, these regions are 
denoted as a CDR1, D "CDR2," and n CDR3 u respectively. An antigen-binding 
site, therefore, includes six CDRs, comprising the CDR set from each of a 
heavy and a light chain V region. A polypeptide comprising a single CDR, (e.g., 

10 a CDR1 , CDR2 or CDR3) is referred to herein as a "molecular recognition unit." 
Crystallographic analysis of a number of antigen-antibody complexes has 
demonstrated that the amino acid residues of CDRs form extensive contact 
with bound antigen, wherein the most extensive antigen contact is with the 
heavy chain CDR3. Thus, the molecular recognition units are primarily 

1 5 responsible for the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino 
acid sequences which frame the CDRs of a CDR set of a heavy or light chain V 
region. Some FR residues may contact bound antigen; however, FRs are 
primarily responsible for folding the V region into the antigen-binding site, 

20 particularly the FR residues directly adjacent to the CDRS. Within FRs, certain 
amino residues and certain structural features are very highly conserved. In 
this regard, all V region sequences contain an internal disulfide loop of around 
90 amino acid residues. When the V regions fold into a binding-site, the CDRs 
are displayed as projecting loop motifs which form an antigen-binding surface. 

25 It is generally recognized that there are conserved structural regions of FRs 
which influence the folded shape of the CDR loops into certain "canonical" 
structures-regardless of the precise CDR amino acid sequence. Further, 
certain FR residues are known to participate in non-covalent interdomain 
contacts which stabilize the interaction of the antibody heavy and light chains. 

30 A number of "humanized" antibody molecules comprising an 

antigen-binding site derived from a non-human immunoglobulin have been 
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described, including chimeric antibodies having rodent V regions and their 
associated CDRs fused to human constant domains (Winter etal. (1991) 
Nature 349:293-299; Lobuglio et al. (1989) Proc. Nat. Acad. Sci. USA 86:4220- 
4224; Shaw et al. (1987) J Immunol. 138:4534-4538; and Brown et al. (1987) 
5 Cancer Res. 47:3577-3583), rodent CDRs grafted into a human supporting FR 
prior to fusion with an appropriate human antibody constant domain 
(Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent 
CDRs supported by recombinarrtly veneered rodent FRs (European Patent 
1 0 Publication No. 51 9,596, published Dec. 23, 1 992). These "humanized" 

molecules are designed to minimize unwanted immunological response toward 
rodent antihuman antibody molecules which limits the duration and 
effectiveness of therapeutic applications of those moieties in human recipients. 
As used herein, the terms "veneered FRs" and "recombinantJy 
1 5 veneered FRs" refer to the selective replacement of FR residues from , e.g. , a 
rodent heavy or light chain V region, with human FR residues in order to 
provide a xenogeneic molecule comprising an antigen-binding site which 
retains substantially all of the native FR polypeptide folding structure. 
Veneering techniques are based on the understanding that the ligand binding 
20 characteristics of an antigen-binding site are determined primarily by the 
structure and relative disposition of the heavy and light chain CDR sets within 
the antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochem. 59:439- 
473. Thus, antigen binding specificity can be preserved in a humanized 
antibody only wherein the CDR structures, their interaction with each other, and 
25 their interaction with the rest of the V region domains are carefully maintained. 
By using veneering techniques, exterior (e.g., solvent-accessible) FR residues 
which are readily encountered by the immune system are selectively replaced 
with human residues to provide a hybrid molecule that comprises either a 
weakly immunogenic, or substantially non-immunogenic veneered surface. 
30 The process of veneering makes use of the available sequence 

data for human antibody variable domains compiled by Kabat et al., in 
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Sequences of Proteins of Immunological Interest, 4th ed., (U.S. Dept. of Health 
and Human Services, U.S. Government Printing Office, 1987), updates to the 
Kabat database, and other accessible U.S. and foreign databases (both nucleic 
acid and protein). Solvent accessibilities of V region amino acids can be 
5 deduced from the known three-dimensional structure for human and murine 
antibody fragments. There are two general steps in veneering a murine 
antigen-binding site. Initially, the FRs of the variable domains of an antibody 
molecule of interest are compared with corresponding FR sequences of human 
variable domains obtained from the above-identified sources. The most 

1 0 homologous human V regions are then compared residue by residue to 

corresponding murine amino acids. The residues in the murine FR which differ 
from the human counterpart are replaced by the residues present in the human 
moiety using recombinant techniques well known in the art. Residue switching 
is only carried out with moieties which are at least partially exposed (solvent 

1 5 accessible), and care is exercised in the replacement of amino acid residues 
which may have a significant effect on the tertiary structure of V region 
domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant D veneered n murine antigen-binding 
sites are thus designed to retain the murine CDR residues, the residues 

20 substantially adjacent to the CDRs, the residues identified as buried or mostly 
buried (solvent inaccessible), the residues believed to participate in non- 
covalent (e.g., electrostatic and hydrophobic) contacts between heavy and light 
chain domains, and the residues from conserved structural regions of the FRs 
which are believed to influence the "canonical" tertiary structures of the CDR 

25 loops. These design criteria are then used to prepare recombinant nucleotide 
sequences which combine the CDRs of both the heavy and light chain of a 
murine antigen-binding site into human-appearing FRs that can be used to 
transfect mammalian cells for the expression of recombinant human antibodies 
which exhibit the antigen specificity of the murine antibody molecule. 

30 In another embodiment of the invention, monoclonal antibodies of 

the present invention may be coupled to one or more therapeutic agents. 
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Suitable agents in this regard include radionuclides, differentiation inducers, 
drugs, toxins, and derivatives thereof. Preferred radionuclides include "°Y, 123 l, 
125| 131 1 i^Re, 188 Re, 211 At, and 212 Bi. p re f erre d drugs include methotrexate, 
and pyrimidine and purine analogs. Preferred differentiation inducers include 
5 phorbol esters and butyric acid. Preferred toxins include ricin, abrin, diptheria 
toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, and 

pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g., covalently bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g., via a linker 
1 0 group). A direct reaction between an agent and an antibody is possible when 
each possesses a substituent capable of reacting with the other. For example, 
a nucleophilic group, such as an amino or sulfhydryl group, on one may be 
capable of reacting with a carbonyl-containing group, such as an anhydride or 
an acid halide, or with an alkyl group containing a good leaving group (e.g., a 
15 halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent 
and an antibody via a linker group. A linker group can function as a spacer to 
distance an antibody from an agent in order to avoid interference with binding 
capabilities. A linker group can also serve to increase the chemical reactivity of 
20 a substituent on an agent or an antibody, and thus increase the coupling 
efficiency. An increase in chemical reactivity may also facilitate the use of 
agents, or functional groups on agents, which otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of 
bifunctional or polyfunctional reagents, both homo- and hetero-functional (such 
25 as those described in the catalog of the Pierce Chemical Co., Rockford, IL), 
may be employed as the linker group. Coupling may be effected, for example, 
through amino groups, carboxyl groups, sulfhydryl groups or oxidized 
carbohydrate residues. There are numerous references describing such 
methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 
30 Where a therapeutic agent is more potent when free from the 

antibody portion of the immunoconjugates of the present invention, it may be 

66 



W O03013431 [h tlp://www. q etthe p atem^ of 308 

WO 03/013431 PCT/US02/24917 

desirable to use a linker group which is cleavable during or upon internalization 
into a cell. A number of different cleavable linker groups have been described. 
The mechanisms for the intracellular release of an agent from these linker 
groups include cleavage by reduction of a disulfide bond (e.g., U.S. Patent 
5 No. 4,489,710, to Spitler), by irradiation of a photolabile bond (e.g., U.S. Patent 
No. 4,625,014, to Senter et al.), by hydrolysis of derivatized amino acid side 
chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by serum complement- 
mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell et al.), and 
acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 

10 It may be desirable to couple more than one agent to an antibody. 

In one embodiment, multiple molecules of an agent are coupled to one 
antibody molecule. In another embodiment, more than one type of agent may 
be coupled to one antibody. Regardless of the particular embodiment, 
immunoconjugates with more than one agent may be prepared in a variety of 

15 ways. For example, more than one agent may be coupled directly to an 

antibody molecule, or linkers that provide multiple sites for attachment can be 
used. Alternatively, a carrier can be used. 

A earner may bear the agents in a variety of ways, including 
covalent bonding either directly or via a linker group. Suitable carriers include 

20 proteins such as albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), 
peptides and polysaccharides such as aminodextran (e.g., U.S. Patent No. 
4,699,784, to Shih et al.). A carrier may also bear an agent by noncovalent 
bonding or by encapsulation, such as within a liposome vesicle (e.g., U.S. 
Patent Nos. 4,429,008 and 4,873,088). Carriers specific for radionuclide 

25 agents include radiohalogenated small molecules and chelating compounds. 
For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate 
may be formed from chelating compounds that include those containing 
nitrogen and sulfur atoms as the donor atoms for binding the metal, or metal 

30 oxide, radionuclide. For example, U.S. Patent No. 4,673,562, to Davison et al. 
discloses representative chelating compounds and their synthesis. 
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T Cell Compositions 

The present invention, in another aspect, provides T cells specific 
for a tumor polypeptide disclosed herein, or for a variant or derivative thereof. 
Such cells may generally be prepared in vitro or ex vivo, using standard 
5 procedures. For example, T cells may be isolated from bone marrow, 

peripheral blood, or a fraction of bone marrow or peripheral blood of a patient, 
using a commercially available cell separation system, such as the Isolex™ 
System, available from Nexell Therapeutics, Inc. (Irvine, CA; see also U.S. 
Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
10 91 /1 61 1 6 and WO 92/07243) . Alternatively, T cells may be derived from 
related or unrelated humans, non-human mammals, cell lines or cultures. 

T cells may be stimulated with a polypeptide, polynucleotide 
encoding a polypeptide and/or an antigen presenting cell (APC) that expresses 
such a polypeptide. Such stimulation is performed under conditions and for a 
1 5 time sufficient to permit the generation of T cells that are specific for the 

polypeptide of interest. Preferably, a tumor polypeptide or polynucleotide of the 
invention is present within a delivery vehicle, such as a microsphere, to 
facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the 
20 present invention if the T cells specifically proliferate, secrete cytokines or kill 
target cells coated with the polypeptide or expressing a gene encoding the 
polypeptide. T cell specificity may be evaluated using any of a variety of 
standard techniques. For example, within a chromium release assay or 
proliferation assay, a stimulation index of more than two fold increase in lysis 
25 and/or proliferation, compared to negative controls, indicates T cell specificity. 
Such assays may be performed, for example, as described in Chen et al., 
Cancer Res. 54:1065-1070, 1994. Alternatively, detection of the proliferation of 
T cells may be accomplished by a variety of known techniques. For example, 
T cell proliferation can be detected by measuring an increased rate of DNA 
30 synthesis (e.g., by pulse-labeling cultures of T cells with tfrtiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact 
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with a tumor polypeptide (100 ng/ml - 100 [ig/ml, preferably 200 ng/ml - 25 
fxg/ml) for 3 - 7 days will typically result in at least a two fold increase in 
proliferation of the T cells. Contact as described above for 2-3 hours should 
result in activation of the T cells, as measured using standard cytokine assays 
5 in which a two fold increase in the level of cytokine release (e.g., TNF or IFN^y) 
is indicative of T cell activation (see Coligan et al., Current Protocols in 
Immunology, vol. 1, Wiley Interscience (Greene 1998)). T cells that have been 
activated in response to a tumor polypeptide, polynucleotide or polypeptide- 
expressing APC may be CD4 + and/or CD8 + . Tumor polypeptide-specif ic T cells 

1 0 may be expanded using standard techniques. Within preferred embodiments, 
the T cells are derived from a patient, a related donor or an unrelated donor, 
and are administered to the patient following stimulation and expansion. 

For therapeutic purposes, CD4+ or CD8+ T cells that proliferate in 
response to a tumor polypeptide, polynucleotide or APC can be expanded in 

1 5 number either in vitro or in vivo. Proliferation of such T cells in vitro may be 
accomplished in a variety of ways. For example, the T cells can be re-exposed 
to a tumor polypeptide, or a short peptide corresponding to an immunogenic 
portion of such a polypeptide, with or without the addition of T cell growth 
factors, such as interleukin-2, and/or stimulator cells that synthesize a tumor 

20 polypeptide. Alternatively, one or more T cells that proliferate in the presence 
of the tumor polypeptide can be expanded in number by cloning. Methods for 
cloning cells are well known in the art, and include limiting dilution. 

T Cell Receptor Compositions 

The T cell receptor (TCR) consists of 2 different, highly variable 

25 polypeptide chains, termed the T-cell receptor a and p chains, that are linked 
by a disulfide bond (Janeway, Travers, Walport. Immunobiology. Fourth Ed., 
148-159, Elsevier Science Ltd/Garland Publishing. 1999). The a/p heterodimer 
complexes with the invariant CD3 chains at the cell membrane. This complex 
recognizes specific antigenic peptides bound to MHC molecules. The 

30 enormous diversity of TCR specificities is generated much like immunoglobulin 
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diversity, through somatic gene rearrangement. The p chain genes contain 
over 50 variable (V), 2 diversity (D), over 10 joining (J) segments, and 2 
constant region segments (C). The a chain genes contain over 70 V segments, 
and over 60 J segments but no D segments, as well as one C segment. During 
5 T cell development in the thymus, the D to J gene rearrangement of the p chain 
occurs, followed by the V gene segment rearrangement to the DJ. This 
functional VDJp exon is transcribed and spliced to join to a Cp. For the a 
chain, a Va gene segment rearranges to a Ja gene segment to create the 
functional exon that is then transcribed and spliced to the Ca. Diversity is 
1 0 further increased during the recombination process by the random addition of P 
and N-nucleotides between the V, D, and J segments of the p chain and 

between the V and J segments in the □ chain (Janeway, Travers, Walport. 

Immunobiology. Fourth Ed., 98 and 150, Elsevier Science Ltd/Garland 

Publishing. 1999). 

! 5 The present invention, in another aspect, provides TCRs specific 

for a polypeptide disclosed herein, or for a variant or derivative thereof. In 
accordance with the present invention, polynucleotide and amino acid 
sequences are provided for the V-J or V-D-J junctional regions or parts thereof 
for the alpha and beta chains of the T-cell receptor which recognize tumor 

20 polypeptides described herein. In general, this aspect of the invention relates 
to T-cell receptors which recognize or bind tumor polypeptides presented in the 
context of MHC. In a preferred embodiment the tumor antigens recognized by 
the T-cell receptors comprise a polypeptide of the present invention. For 
example, cDNA encoding a TCR specific for a breast tumor peptide can be 

25 isolated from T cells specific for a tumor polypeptide using standard molecular 
biological and recombinant DNA techniques. 

This invention further includes the T-cell receptors or analogs 
thereof having substantially the same function or activity as the T-cell receptors 
of this invention which recognize or bind tumor polypeptides. Such receptors 

30 include, but are not limited to, a fragment of the receptor, or a substitution, 
addition or deletion mutant of a T-cell receptor provided herein. This invention 
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also encompasses polypeptides or peptides that are substantially homologous 
to the T-cell receptors provided herein or that retain substantially the same 
activity. The term "analog" includes any protein or polypeptide having an amino 
acid residue sequence substantially identical to the T-cell receptors provided 
5 herein in which one or more residues, preferably no more than 5 residues, 
more preferably no more than 25 residues have been conservatively 
substituted with a functionally similar residue and which displays the functional 
aspects of the T-cell receptor as described herein. 

The present invention further provides for suitable mammalian 

1 0 host cells, for example, non-specific T cells, that are transfected with a 
polynucleotide encoding TCRs specific for a polypeptide described herein, 
thereby rendering the host cell specific for the polypeptide. The a and p chains 
of the TCR may be contained on separate expression vectors or alternatively, 
on a single expression vector that also contains an internal ribosome entry site 

15 (IRES) for cap-independent translation of the gene downstream of the IRES. 
Said host cells expressing TCRs specific for the polypeptide may be used, for 
example, for adoptive immunotherapy of breast cancer as discussed further 
below. 

In further aspects of the present invention, cloned TCRs specific 
20 for a polypeptide recited herein may be used in a kit for the diagnosis of breast 
cancer. For example, the nucleic acid sequence or portions thereof, of tumor- 
specific TCRs can be used as probes or primers for the detection of expression 
of the rearranged genes encoding the specific TCR in a biological sample. 
Therefore, the present invention further provides for an assay for detecting 
25 messenger RNA or DNA encoding the TCR specific for a polypeptide. 

Pharmaceutical Compositions 

In additional embodiments, the present invention concerns 
formulation of one or more of the polynucleotide, polypeptide, T-cell, TCR, 
and/or antibody compositions disclosed herein in pharmaceutically-acceptable 
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carriers for administration to a cell or an animal, either alone, or in combination 
with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed 
herein may be administered in combination with other agents as well, such as, 

5 e.g., other proteins or polypeptides or various pharmaceutically-active agents. 
In fact, there is virtually no limit to other components that may also be included, 
given that the additional agents do not cause a significant adverse effect upon 
contact with the target cells or host tissues. The compositions may thus be 
delivered along with various other agents as required in the particular instance. 

1 0 Such compositions may be purified from host cells or other biological sources, 
or alternatively may be chemically synthesized as described herein. Likewise, 
such compositions may further comprise substituted or derivatized RNA or DNA 
compositions. 

Therefore, in another aspect of the present invention, 

1 5 pharmaceutical compositions are provided comprising one or more of the 
polynucleotide, polypeptide, antibody, TCR, and/or T-cell compositions 
described herein in combination with a physiologically acceptable carrier. In 
certain preferred embodiments, the pharmaceutical compositions of the 
invention comprise immunogenic polynucleotide and/or polypeptide 

20 compositions of the invention for use in prophylactic and theraputic vaccine 
applications. Vaccine preparation is generally described in, for example, M.F. 
Powell and M J. Newman, eds., "Vaccine Design (the subunit and adjuvant 
approach)," Plenum Press (NY, 1995). Generally, such compositions will 
comprise one or more polynucleotide and/or polypeptide compositions of the 

25 present invention in combination with one or more immunostimulants. 

It will be apparent that any of the pharmaceutical compositions 
described herein can contain pharmaceutical^ acceptable salts of the 
polynucleotides and polypeptides of the invention. Such salts can be prepared, 
for example, from pharmaceutical^ acceptable non-toxic bases, including 

30 organic bases (e.g., salts of primary, secondary and tertiary amines and basic 
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amino acids) and inorganic bases (e.g., sodium, potassium, lithium, 
ammonium, calcium and magnesium salts). 

In another embodiment, illustrative immunogenic compositions, 
e.g., vaccine compositions, of the present invention comprise DNA encoding 
5 one or more of the polypeptides as described above, such that the polypeptide 
is generated in situ. As noted above, the polynucleotide may be administered 
within any of a variety of delivery systems known to those of ordinary skill in the 
art. Indeed, numerous gene delivery techniques are well known in the art, such 
as those described by Rolland, Crit Rev. Therap. Drug Carrier Systems 

10 75:143-198, 1998, and references cited therein. Appropriate polynucleotide 
expression systems will, of course, contain the necessary regulatory DNA 
regulatory sequences for expression in a patient (such as a suitable promoter 
and terminating signal). Alternatively, bacterial delivery systems may involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that 

1 5 expresses an immunogenic portion of the polypeptide on its cell surface or 
secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immunogenic polypeptides described herein are introduced into suitable 
mammalian host cells for expression using any of a number of known viral- 

20 based systems. In one illustrative embodiment, retroviruses provide a 
convenient and effective platform for gene delivery systems. A selected 
nucleotide sequence encoding a polypeptide of the present invention can be 
inserted into a vector and packaged in retroviral particles using techniques 
known in the art. The recombinant vims can then be isolated and delivered to 

25 a subject. A number of illustrative retroviral systems have been described 

(e.g., U.S. Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980- 
990; Miller, A. D. (1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) 
Virology 180:849-852; Bums et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033- 
8037; and Boris-Lawrie and Temin (1993) Cur. Opin. Genet. Develop. 3:102- 

30 109. 
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In addition, a number of illustrative adenovirus-based systems 
have also been described. Unlike retroviruses which integrate into the host 
genome, adenoviruses persist extrachromosomally thus minimizing the risks 
associated with insertional mutagenesis (Haj-Ahmad and Graham (1986) J. 
5 Virol. 57:267-274; Bett et al. (1 993) J. Virol. 67:591 1 -5921 ; Mittereder et al. 
(1994) Human Gene Therapy 5:717-729; Seth etal. (1994) J. Virol. 68:933- 
940; Barr et al. (1994) Gene Therapy 1 :51-58; Berkner, K. L (1988) 
BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

1 o Various adeno-associated virus (AAV) vector systems have also 

been developed for polynucleotide delivery. AAV vectors can be readily 
constructed using techniques well known in the art. See, e.g., U.S. Pat. Nos. 
5,173,414 and 5,139,941; International Publication Nos. WO 92/01070 and 
WO 93/03769; Lebkowski et al. (1 988) Molec. Cell. Biol. 8:3988-3996; Vincent 
15 et al. (1 990) Vaccines 90 (Cold Spring Harbor Laboratory Press); Carter, B. J. 
(1992) Current Opinion in Biotechnology 3:533-539; Muzyczka, N. (1992) 
Current Topics in Microbiol, and Immunol. 158:97-129; Kotin, R. M. (1994) 
Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene Therapy 
1:165-169; and Zhou etal. (1994) J. Exp. Med. 179:1867-1875. 
20 Additional viral vectors useful for delivering the polynucleotides 

encoding polypeptides of the present invention by gene transfer include those 
derived from the pox family of viruses, such as vaccinia virus and avian 
poxvirus. By way of example, vaccinia virus recombinants expressing the novel 
molecules can be constructed as follows. The DNA encoding a polypeptide is 
25 first inserted into an appropriate vector so that it is adjacent to a vaccinia 
promoter and flanking vaccinia DNA sequences, such as the sequence 
encoding thymidine kinase (TK). This vector is then used to transfect cells 
which are simultaneously infected with vaccinia. Homologous recombination 
serves to insert the vaccinia promoter plus the gene encoding the polypeptide 
30 of interest into the viral genome. The resulting TK.sup. (-) recombinant can be 
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selected by cutturing the cells in the presence of 5-bromodeoxyuridine and 
picking viral plaques resistant thereto. 

A vaccinia-based infectionAransfection system can be 
conveniently used to provide for inducible, transient expression or coexpression 
5 of one or more polypeptides described herein in host cells of an organism. In 
this particular system, cells are first infected in vitro with a vaccinia virus 
recombinant that encodes the bacteriophage T7 RNA polymerase. This 
polymerase displays exquisite specificity in that it only transcribes templates 
bearing T7 promoters. Following infection, cells are transfected with the 

10 polynucleotide or polynucleotides of interest, driven by a T7 promoter. The 
polymerase expressed in the cytoplasm from the vaccinia virus recombinant 
transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high 
level, transient, cytoplasmic production of large quantities of RNA and its 

15 translation products. See, e.g., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. 
USA (1990) 87:6743-6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 
83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox 
viruses, can also be used to deliver the coding sequences of interest. 

20 Recombinant avipox viruses, expressing immunogens from mammalian 

pathogens, are known to confer protective immunity when administered to non- 
avian species. The use of an Avipox vector is particularly desirable in human 
and other mammalian species since members of the Avipox genus can only 
productively replicate in susceptible avian species and therefore are not 

25 infective in mammalian cells. Methods for producing recombinant 

Avipoxviruses are known in the art and employ genetic recombination, as 
described above with respect to the production of vaccinia viruses. See, e.g., 
WO 91/12882; WO 89/03429; and WO 92/03545. 

Any of a number of alphavirus vectors can also be used for 

30 delivery of polynucleotide compositions of the present invention, such as those 
vectors described in U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 
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6,015,694. Certain vectors based on Venezuelan Equine Encephalitis (VEE) 
can also be used, illustrative examples of which can be found in U.S. Patent 
Nos. 5,505,947 and 5,643,576. 

Moreover, molecular conjugate vectors, such as the adenovirus 

5 chimeric vectors described in Michael et al. J. Biol. Chem. (1 993) 268:6866- 
6869 and Wagner et al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can 
also be used for gene delivery under the invention. 

Additional illustrative information on these and other known viral- 
based delivery systems can be found, for example, in Psher-Hoch et al., Proc. 

1 0 Natl. Acad. Sci. USA 86:31 7-321 , 1 989; Flexner et al., Ann. N. Y. Acad. Sci. 
569:86-103, 1989; Flexner et al., Vaccine 8:17-21, 1990; U.S. Patent 
Nos. 4,603,1 12, 4,769,330, and 5,017,487; WO 89/01973; U.S. Patent 
No. 4,777,127; GB 2,200,651 ; EP 0,345,242; WO 91/02805; Berkner, 
Biotechniques 6:616-627, 1988; Rosenfeld et al., Science 252:431-434, 1991; 

1 5 Kolls et al., Proc. Natl. Acad. Sci. USA 97:21 5-21 9, 1 994; Kass-Eisler et al., 
Proc. Natl. Acad. Sci. USA 90:1 1 498-1 1502, 1993; Guzman et al., Circulation 
88.2838-2848, 1993; and Guzman et al., Cir. Res. 73:1202-1207, 1993. 

In certain embodiments, a polynucleotide may be integrated into 
the genome of a target cell. This integration may be in the specific location and 

20 orientation via homologous recombination (gene replacement) or it may be 
integrated in a random, non-specific location (gene augmentation). In yet 
further embodiments, the polynucleotide may be stably maintained in the cell 
as a separate, episomal segment of DNA. Such polynucleotide segments or 
"episomes" encode sequences sufficient to permit maintenance and replication 

25 independent of or in synchronization with the host ceil cycle. The manner in 
which the expression construct is delivered to a cell and where in the cell the 
polynucleotide remains is dependent on the type of expression construct 
employed. 

In another embodiment of the invention, a polynucleotide is 
30 administered/delivered as "naked" DNA, for example as described in Ulmer et 
al., Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691- 
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1 692, 1 993. The uptake of naked DNA may be increased by coating the DNA 
onto biodegradable beads, which are efficiently transported into the cells. 

In still another embodiment, a composition of the present 
invention can be delivered via a particle bombardment approach, many of 
5 which have been described. In one illustrative example, gas-driven particle 
acceleration can be achieved with devices such as those manufactured by 
Powderject Pharmaceuticals PLC (Oxford, UK) and Powderject Vaccines Inc. 
(Madison, Wl), some examples of which are described in U.S. Patent Nos. 
5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 799. 
1 0 This approach offers a needle-free delivery approach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide 
particles, are accelerated to high speed within a helium gas jet generated by a 
hand held device, propelling the particles into a target tissue of interest. 

In a related embodiment, other devices and methods that may be 
1 5 useful for gas-driven needle-less injection of compositions of the present 
invention include those provided by Bioject, Inc. (Portland, OR), some 
examples of which are described in U.S. Patent Nos. 4,790,824; 5,064,413; 
5,312,335; 5,383,851; 5,399,163; 5,520,639 and 5,993,412. 

According to another embodiment, the pharmaceutical 
20 compositions described herein will comprise one or more immunostimulants in 
addition to the immunogenic polynucleotide, polypeptide, antibody, T-cell, TCR, 
and/or APC compositions of this invention. An immunostimulant refers to 
essentially any substance that enhances or potentiates an immune response 
(antibody and/or cell-mediated) to an exogenous antigen. One preferred type 
25 of immunostimulant comprises an adjuvant. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as 
aluminum hydroxide or mineral oil, and a stimulator of immune responses, such 
as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 
Certain adjuvants are commercially available as, for example, Freund's 
30 Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, Ml); 
Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
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Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel 
(alum) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble 
suspension of acylated tyrosine; acylated sugars; cationically or anionically 
derivatized polysaccharides; polyphosphazenes; biodegradable microspheres; 

5 monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF, interleukin-2, 
-7, -12, and other like growth factors, may also be used as adjuvants. 

Within certain embodiments of the invention, the adjuvant 
composition is preferably one that induces an immune response predominantly 
of the Th1 type. High levels of Th1 -type cytokines (e.g., IFN-y, TNFct, IL-2 and 

1 0 IL-12) tend to favor the induction of cell mediated immune responses to an 
administered antigen. In contrast, high levels of Th2-type cytokines (e.g., IL-4, 
IL-5, IL-6 and IL-1 0) tend to favor the induction of humoral immune responses. 
Following application of a vaccine as provided herein, a patient will support an 
immune response that includes Th1- and Th2-type responses. Within a 

1 5 preferred embodiment, in which a response is predominantly Thl -type, the 
level of Th1 -type cytokines will increase to a greater extent than the level of 
Th2-type cytokines. The levels of these cytokines may be readily assessed 
using standard assays. For a review of the families of cytokines, see Mosmann 
andCoffman, Ann. Rev. Immunol. 7:145-173, 1989. 

20 Certain preferred adjuvants for eliciting a predominantly Thl -type 

response include, for example, a combination of monophosphoryl lipid A, 
preferably 3-de-O-acylated monophosphoryl lipid A, together with an aluminum 
salt MPL® adjuvants are available from Corixa Corporation (Seattle, WA; see, 
for example, US Patent Nos. 4,436,727; 4,877,61 1 ; 4,866,034 and 4,912,094). 

25 CpG-containing oligonucleotides (in which the CpG dinucleotide is 
unmethylated) also induce a predominantly Th1 response. Such 
oligonucleotides are well known and are described, for example, in WO 
96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 5,856,462. 
Immunostimulatory DNA sequences are also described, for example, by Sato 

30 et ah, Science 273:352, 1 996. Another preferred adjuvant comprises a 
saponin, such as Quil A, or derivatives thereof, including QS21 and QS7 
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(Aquila Biopharmaceuticals Inc., Framingham, MA); Escin; Digitonin; or 
Gypsophila or Chenopodium quinoa saponins. Other preferred formulations 
include more than one saponin in the adjuvant combinations of the present 
invention, for example combinations of at least two of the following group 
5 comprising QS21 , QS7, Quil A, p-escin, or digitonin. 

Alternatively the saponin formulations may be combined with 
vaccine vehicles composed of chitosan or other polycationic polymers, 
polylactide and polylactide-co-glycolide particles, poly-N-acetyl glucosamine- 
based polymer matrix, particles composed of polysaccharides or chemically 
10 modified polysaccharides, liposomes and lipid-based particles, particles 

composed of glycerol monoesters, etc. The saponins may also be formulated 
in the presence of cholesterol to form particulate structures such as liposomes 
or ISCOMs. Furthermore, the saponins may be formulated together with a 
polyoxyethylene ether or ester, in either a non-particulate solution or 

1 5 suspension, or in a particulate structure such as a paucilamelar liposome or 
ISCOM. The saponins may also be formulated with excipients such as 
Carbopol R to increase viscosity, or may be formulated in a dry powder form with 
a powder excipient such as lactose. 

In one preferred embodiment, the adjuvant system includes the 

20 combination of a monophosphoryl lipid A and a saponin derivative, such as the 
combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or 
a less reactogenic composition where the QS21 is quenched with cholesterol, 
as described in WO 96/33739. Other preferred formulations comprise an oil-in- 
water emulsion and tocopherol. Another particularly preferred adjuvant 

25 formulation employing QS21 , 3D-MPL® adjuvant and tocopherol in an oil-in- 
water emulsion is described in WO 95/1 721 0. 

Another enhanced adjuvant system involves the combination of a 
CpG-containing oligonucleotide and a saponin derivative particularly the 
combination of CpG and QS21 is disclosed in WO 00/09159. Preferably the 

30 formulation additionally comprises an oil in water emulsion and tocopherol. 
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Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), 
SAF (Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the 
SBAS series of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithWine 

5 Beecham, Rixensart, Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC- 
529 (Corixa, Hamilton, MT) and other aminoalkyl glucosaminide 4-phosphates 
(AGPs), such as those described in pending U.S. Patent Application Serial 
Nos. 08/853,826 and 09/074,720, the disclosures of which are incorporated 
herein by reference in their entireties, and polyoxyethylene ether adjuvants 

1 0 such as those described in WO 99/52549A1 . 

Other preferred adjuvants include adjuvant molecules of the 

general formula 

(I): HO(CH 2 CH20)„-A-R, 

wherein, n is 1-50, A is a bond or-C(0)-, R is C1-50 alkyl or 

15 Phenyl Ci-50 alkyl. 

One embodiment of the present invention consists of a vaccine 
formulation comprising a polyoxyethylene ether of general formula (I), wherein 
n is between 1 and 50, preferably 4-24, most preferably 9; the R component is 
Ci-5o, preferably C4-C20 alkyl and most preferably C12 alkyl, and A is a bond. 

20 The concentration of the polyoxyethylene ethers should be in the range 0.1 - 
20%, preferably from 0.1-1 0%, and most preferably in the range 0.1 -1 %. 
Preferred polyoxyethylene ethers are selected from the following group: 
polyoxyethylene-9-lauryl ether, polyoxyethylene-9-steoryl ether, 
polyoxyethylene-8-steoryl ether, polyoxyethylene-4-lauryl ether, 

25 polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 

Polyoxyethylene ethers such as polyoxyethylene lauryl ether are described in 
the Merck index (12 th edition: entry 7717). These adjuvant molecules are 
described in WO 99/52549. 

The polyoxyethylene ether according to the general formula (I) 

30 above may, if desired, be combined with another adjuvant. For example, a 
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preferred adjuvant combination is preferably with CpG as described in the 
pending UK patent application GB 9820956.2. 

According to another embodiment of this invention, an 
immunogenic composition described herein is delivered to a host via antigen 
5 presenting cells (APCs), such as dendritic cells, macrophages, B cells, 

monocytes and other cells that may be engineered to be efficient APCs. Such 
cells may, but need not, be genetically modified to increase the capacity for 
presenting the antigen, to improve activation and/or maintenance of the T cell 
response, to have anti-tumor effects perse and/or to be immunologically 

1 0 compatible with the receiver (i.e., matched HLA haplotype). APCs may 
generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, 
syngeneic or xenogeneic cells. 

Certain preferred embodiments of the present invention use 

1 5 dendritic cells or progenitors thereof as antigen-presenting cells. Dendritic cells 
are highly potent APCs (Banchereau and Steinman, Nature 392:245-251 , 
1998) and have been shown to be effective as a physiological adjuvant for 
eliciting prophylactic or therapeutic antitumor immunity (see Timmerman and 
Levy, Ann. Rev. Med. 50:507-529, 1999). In general, dendritic cells may be 

20 identified based on their typical shape (stellate in situ, with marked cytoplasmic 
processes (dendrites) visible in vitro), their ability to take up, process and 
present antigens with high efficiency and their ability to activate naive T cell 
responses. Dendritic cells may, of course, be engineered to express specific 
cell-surface receptors or ligands that are not commonly found on dendritic cells 

25 in vivo or ex vivo, and such modified dendritic cells are contemplated by the 
present invention. As an alternative to dendritic cells, secreted vesicles 
antigen-loaded dendritic cells (called exosomes) may be used within a vaccine 
(see Zrtvogel et al., Nature Med 4:594-600, 1998). 

Dendritic cells and progenitors may be obtained from peripheral 

30 blood, bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, 
lymph nodes, spleen, skin, umbilical cord blood or any other suitable tissue or 
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fluid. For example, dendritic cells may be differentiated ex vivo by adding a 
combination of cytokines such as GM-CSF, IL-4, IL-13 and/or TNFa to cultures 
of monocytes harvested from peripheral blood. Alternatively, CD34 positive 
cells harvested from peripheral blood, umbilical cord blood or bone marrow 
5 may be differentiated into dendritic cells by adding to the culture medium 
combinations of GM-CSF, IL-3, TNFa, CD40 ligand, LPS, flt3 ligand and/or 
other compound(s) that induce differentiation, maturation and proliferation of 
dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and 

1 0 "mature" cells, which allows a simple way to discriminate between two well 
characterized phenotypes. However, this nomenclature should not be 
construed to exclude all possible intermediate stages of differentiation. 
Immature dendritic cells are characterized as APC with a high capacity for 
antigen uptake and processing, which correlates with the high expression of 

15 Fey receptor and mannose receptor. The mature phenotype is typically 

characterized by a lower expression of these markers, but a high expression of 
cell surface molecules responsible for T cell activation such as class I and class 
II MHC, adhesion molecules (e.g., CD54 and CD11) and costimulatory 
molecules (e.g., CD40, CD80, CD86 and 4-1 BB). 

20 APCs may generally be transfected with a polynucleotide of the 

invention (or portion or other variant thereof) such that the encoded 
polypeptide, or an immunogenic portion thereof, is expressed on the cell 
surface. Such transection may take place ex vivo, and a pharmaceutical 
composition comprising such transfected cells may then be used for 

25 therapeutic purposes, as described herein. Alternatively, a gene delivery 
vehicle that targets a dendritic or other antigen presenting cell may be 
administered to a patient, resulting in transection that occurs in vivo. In vivo 
and ex vivo transection of dendritic cells, for example, may generally be 
performed using any methods known in the art, such as those described in WO 

30 97/24447, or the gene gun approach described by Mahvi et al., Immunology 
and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells may be 
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achieved by incubating dendritic cells or progenitor cells with the tumor 
polypeptide, DNA (naked or within a plasmid vector) or RNA; or with antigen- 
expressing recombinant bacterium or viruses (e.g., vaccinia, fowlpox, 
adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
5 covalently conjugated to an immunological partner that provides T cell help 
(e.g., a carrier molecule). Alternatively, a dendritic cell may be pulsed with a 
non-conjugated immunological partner, separately or in the presence of the 
polypeptide. 

While any suitable carrier known to those of ordinary skill in the 

1 0 art may be employed in the pharmaceutical compositions of this invention, the 
type of carrier will typically vary depending on the mode of administration. 
Compositions of the present invention may be formulated for any appropriate 
manner of administration, including for example, topical, oral, nasal, mucosal, 
intravenous, intracranial, intraperitoneal, subcutaneous and intramuscular 

15 administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. In certain embodiments, the 
formulation preferably provides a relatively constant level of active component 
release. In other embodiments, however, a more rapid rate of release 

20 immediately upon administration may be desired. The formulation of such 
compositions is well within the level of ordinary skill in the art using known 
techniques. Illustrative carriers useful in this regard include microparticles of 
poly(lactide-co-glycolide), polyacrylate, latex, starch, cellulose, dextran and the 
like. Other illustrative delayed-release carriers include supramolecular 

25 biovectors, which comprise a non-liquid hydrophilic core (e.g., a cross-linked 
polysaccharide or oligosaccharide) and, optionally, an external layer comprising 
an amphophilic compound, such as a phospholipid (see e.g., U.S. Patent No. 
5,151 ,254 and PCT applications WO 94/20078, WO/94/23701 and WO 
96/06638). The amount of active compound contained within a sustained 

30 release formulation depends upon the site of implantation, the rate and 
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expected duration of release and the nature of the condition to be treated or 
prevented. 

In another illustrative embodiment, biodegradable microspheres 
(e.g., polylactate polyglycolate) are employed as carriers for the compositions 

5 of this invention. Suitable biodegradable microspheres are disclosed, for 
example, in U.S. Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,81 1 ,128; 
5,820,883; 5,853,763; 5,814,344, 5,407,609 and 5,942,252. Modified hepatitis 
B core protein carrier systems, such as described in WO/99 40934, and 
references cited therein, will also be useful for many applications. Another 

1 0 illustrative carrier/delivery system employs a earner comprising particulate- 
protein complexes, such as those described in U.S. Patent No. 5,928,647, 
which are capable of inducing a class l-restricted cytotoxic T lymphocyte 

responses in a host. 

In another illustrative embodiment, calcium phosphate core 
1 5 particles are employed as carriers, vaccine adjuvants, or as controlled release 
matrices for the compositions of this invention. Exemplary calcium phosphate 
particles are disclosed, for example, in published patent application No. 
WO/0046147. 

The pharmaceutical compositions of the invention will often 
20 further comprise one or more buffers (e.g., neutral buffered saline or phosphate 
buffered saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), 
mannitol, proteins, polypeptides or amino acids such as glycine, antioxidants, 
bacteriostats, chelating agents such as EDTA or glutathione, adjuvants (e.g., 
aluminum hydroxide), solutes that render the formulation isotonic, hypotonic or 
25 weakly hypertonic with the blood of a recipient, suspending agents, thickening 
agents and/or preservatives. Alternatively, compositions of the present 
invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be 
presented in unit-dose or multi-dose containers, such as sealed ampoules or 
30 vials. Such containers are typically sealed in such a way to preserve the 
sterility and stability of the formulation until use. In general, formulations may 
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be stored as suspensions, solutions or emulsions in oily or aqueous vehicles. 
Alternatively, a pharmaceutical composition may be stored in a freeze-dried 
condition requiring only the addition of a sterile liquid carrier immediately prior 
to use. 

5 The development of suitable dosing and treatment regimens for 

using the particular compositions described herein in a variety of treatment 
regimens, including e.g., oral, parenteral, intravenous, intranasal, and 
intramuscular administration and formulation, is well known in the art, some of 
which are briefly discussed below for general purposes of illustration. 

10 In certain applications, the pharmaceutical compositions 

disclosed herein may be delivered via oral administration to an animal. As 
such, these compositions may be formulated with an inert diluent or with an 
assimilable edible carrier, or they may be enclosed in hard- or soft-shell gelatin 
capsule, or they may be compressed into tablets, or they may be incorporated 

1 5 directly with the food of the diet 

The active compounds may even be incorporated with excipients 
and used in the form of ingestible tablets, buccal tables, troches, capsules, 
elixirs, suspensions, syrups, wafers, and the like (see, for example, Mathiowitz 
et a/., Nature 1 997 Mar 27;386(6623):41 0-4; Hwang et a/., Crit Rev Ther Drug 

20 Carrier Syst 1998;15(3):243-84; U.S. Patent 5,641 ,51 5; U.S. Patent 5,580,579 
and U.S. Patent 5,792,451 )U.STablets, troches, pills, capsules and the like 
may also contain any of a variety of additional components, for example, a 
binder, such as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such 
as dicalcium phosphate; a disintegrating agent, such as com starch, potato 

25 starch, alginic acid and the like; a lubricant, such as magnesium stearate; and a 
sweetening agent, such as sucrose, lactose or saccharin may be added or a 
flavoring agent, such as peppermint, oil of wintergreen, or cherry flavoring. 
When the dosage unit form is a capsule, it may contain, in addition to materials 
of the above type, a liquid carrier. Various other materials may be present as 

30 coatings or to otherwise modify the physical form of the dosage unit. For 
instance, tablets, pills, or capsules may be coated with shellac, sugar, or both. 
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Of course, any material used in preparing any dosage unit form should be 
pharmaceutically pure and substantially non-toxic in the amounts employed. In 
addition, the active compounds may be incorporated into sustained-release 
preparation and formulations. 
5 Typically, these formulations will contain at least about 0. 1 % of 

the active compound or more, although the percentage of the active 
ingredients) may, of course, be varied and may conveniently be between 
about 1 or 2% and about 60% or 70% or more of the weight or volume of the 
total formulation. Naturally, the amount of active compound (s) in each 
1 0 therapeutically useful composition may be prepared is such a way that a 
suitable dosage will be obtained in any given unit dose of the compound. 
Factors such as solubility, bioavailability, biological half-life, route of 
administration, product shelf life, as well as other pharmacological 
considerations will be contemplated by one skilled in the art of preparing such 
1 5 pharmaceutical formulations, and as such, a variety of dosages and treatment 
regimens may be desirable. 

For oral administration the compositions of the present invention 
may alternatively be incorporated with one or more excipients in the form of a 
mouthwash, dentifrice, buccal tablet, oral spray, or sublingual orally- 
20 administered formulation. Alternatively, the active ingredient may be 
incorporated into an oral solution such as one containing sodium borate, 
glycerin and potassium bicarbonate, or dispersed in a dentifrice, or added in a 
therapeutically-effective amount to a composition that may include water, 
binders, abrasives, flavoring agents, foaming agents, and humectants. 
25 Aitematively the compositions may be fashioned into a tablet or solution form 
that may be placed under the tongue or otherwise dissolved in the mouth. 

In certain circumstances it will be desirable to deliver the 
pharmaceutical compositions disclosed herein parenterally, intravenously, 
intramuscularly, or even intraper'rtoneally. Such approaches are well known to 
30 the skilled artisan, some of which are further described, for example, in U.S. 
Patent 5,543,158; U.S. Patent 5,641 ,515 and U.S. Patent 5,399,363. In certain 
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embodiments, solutions of the active compounds as free base or 
pharmacologically acceptable salts may be prepared in water suitably mixed 
with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in 
5 oils. Under ordinary conditions of storage and use, these preparations 

generally will contain a preservative to prevent the growth of microorganisms. 

Illustrative pharmaceutical forms suitable for injectable use 
include sterile aqueous solutions or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersions (for 

1 0 example, see U.S. Patent 5,466,468). In all cases the form must be sterile and 
must be fluid to the extent that easy syringability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms, such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, for example, water, 

15 ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, 
and the like), suitable mixtures thereof, and/or vegetable oils. Proper fluidity 
may be maintained, for example, by the use of a coating, such as lecithin, by 
the maintenance of the required particle size in the case of dispersion and/or by 
the use of surfactants. The prevention of the action of microorganisms can be 

20 facilitated by various antibacterial and antifungal agents, for example, 

parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many 
cases, it will be preferable to include isotonic agents, for example, sugars or 
sodium chloride. Prolonged absorption of the injectable compositions can be 
brought about by the use in the compositions of agents delaying absorption, for 

25 example, aluminum monostearate and gelatin. 

In one embodiment, for parenteral administration in an aqueous 
solution, the solution should be suitably buffered if necessary and the liquid 
diluent first rendered isotonic with sufficient saline or glucose. These particular 
aqueous solutions are especially suitable for intravenous, intramuscular, 

30 subcutaneous and intraperitoneal administration. In this connection, a sterile 
aqueous medium that can be employed will be known to those of skill in the art 
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in light of the present disclosure. For example, one dosage may be dissolved 
in 1 ml of isotonic NaCI solution and either added to 1000 ml of 
hypodermoclysis fluid or injected at the proposed site of infusion, (see for 
example, "Remington's Pharmaceutical Sciences" 15th Edition, pages 1035- 

5 1 038 and 1 570-1 580). Some variation in dosage will necessarily occur 
depending on the condition of the subject being treated. Moreover, for human 
administration, preparations will of course preferably meet sterility, pyrogenicity, 
and the general safety and purity standards as required by FDA Office of 
Biologies standards. 

1 o In another embodiment of the invention, the compositions 

disclosed herein may be formulated in a neutral or salt form. Illustrative 
pharmaceutically-acceptable salts include the acid addition salts (formed with 
the free amino groups of the protein) and which are formed with inorganic acids 
such as, for example, hydrochloric or phosphoric acids, or such organic acids 

15 as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free 
carboxyl groups can also be derived from inorganic bases such as, for 
example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and 
such organic bases as isopropylamine, trimethylamine, histidine, procaine and 
the like. Upon formulation, solutions will be administered in a manner 

20 compatible with the dosage formulation and in such amount as is 
therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion 
media, vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic 
and absorption delaying agents, buffers, carrier solutions, suspensions, 

25 colloids, and the like. The use of such media and agents for pharmaceutical 
active substances is well known in the art. Except insofar as any conventional 
media or agent is incompatible with the active ingredient, its use in the 
therapeutic compositions is contemplated. Supplementary active ingredients 
can also be incorporated into the compositions. The phrase 

30 n pharmaceutically-acceptable u refers to molecular entities and compositions 
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that do not produce an allergic or similar untoward reaction when administered 
to a human. 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery 
5 vehicles. Methods for delivering genes, nucleic acids, and peptide 

compositions directly to the lungs via nasal aerosol sprays has been described, 
e.g., in U.S. Patent 5,756,353 and U.S. Patent 5,804,212. Likewise, the 
delivery of drugs using intranasal microparticle resins (Takenaga e/a/., J 
Controlled Release 1998 Mar 2;52(1-2):81-7) and lysophosphatidyl-glycerol 
10 compounds (U.S. Patent 5,725,871) are also well-known in the pharmaceutical 
arts. Likewise, illustrative transmucosal drug delivery in the form of a 
polytetrafluoroetheylene support matrix is described in U.S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, 
lipid particles, vesicles, and the like, are used for the introduction of the 
1 5 compositions of the present invention into suitable host cells/organisms. In 
particular, the compositions of the present invention may be formulated for 
delivery either encapsulated in a lipid particle, a liposome, a vesicle, a 
nanosphere, or a nanoparticle or the like. Alternatively, compositions of the 
present invention can be bound, either covalentiy or non-covalently, to the 
20 surface of such carrier vehicles. 

The formation and use of liposome and liposome-like 
preparations as potential drug carriers is generally known to those of skill in the 
art (see for example, Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; 
Takakura, Nippon Rinsho 1998 Mar;56(3):691-5; Chandran et a/., Indian J Exp 
25 Biol. 1 997 Aug;35(8):801 -9; Margalit, Cr'rt Rev Ther Drug Carrier Syst 
1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. Patent 5,552,157; U.S. 
Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 5,795,587, each 
specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell 
30 types that are normally difficult to transfect by other procedures, including T cell 
suspensions, primary hepatocyte cultures and PC 12 cells (Renneisen et a/., J 
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Biol Chem. 1990 Sep 25;265(27):1 6337-42; Muller et a/., DNA Cell Biol. 1990 
Apr;9(3):221-9). In addition, liposomes are free of the DNA length constraints 
that are typical of viral-based delivery systems. Liposomes have been used 
effectively to introduce genes, various drugs, radiotherapeutic agents, 
5 enzymes, viruses, transcription factors, allosteric effectors and the like, into a 
variety of cultured cell lines and animals. Furthermore, he use of liposomes 
does not appear to be associated with autoimmune responses or unacceptable 
toxicity after systemic delivery. 

In certain embodiments, liposomes are formed from 
10 phospholipids that are dispersed in an aqueous medium and spontaneously 
form multilamellar concentric bilayer vesicles (also termed multilamellar 
vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
pharmaceutically-acceptable nanocapsule formulations of the compositions of 
15 the present invention. Nanocapsules can generally entrap compounds in a 
stable and reproducible way (see, for example, Quintanar-Guerrero et a/., Drug 
Dev Ind Pharm. 1 998 Dec;24(1 2):1 1 1 3-28). To avoid side effects due to 
intracellular polymeric overloading, such ultrafine particles (sized around 0.1 
\xm) may be designed using polymers able to be degraded in vivo. Such 
20 particles can be made as described, for example, by Couvreur et a/., Crit Rev 
Ther Drug Carrier Syst. 1988;5(1):1-20; zur Muhlen et a/., Eur J Pharm 
Biopharm. 1998 Mar;45(2):1 49-55; Zambaux et a/. J Controlled Release. 1998 
Jan 2;50(1-3):31-40; and U.S. Patent 5,145,684. 

Cancer Therapeutic Methods 

25 Immunologic approaches to cancer therapy are based on the 

recognition that cancer cells can often evade the body's defenses against 
aberrant or foreign cells and molecules, and that these defenses might be 
therapeutically stimulated to regain the lost ground, e.g., pgs. 623-648 in Klein, 
Immunology (Wiley-lnterscience, New York, 1982). Numerous recent 

30 observations that various immune effectors can directly or indirectly inhibit 
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growth of tumors has led to renewed interest in this approach to cancer 
therapy, e.g., Jager, et aL, Oncology 2001 ;60(1):1 -7; Renner, et al., Ann 
Hematol 2000 Dec;79(12):651-9. 

Four-basic cell types whose function has been associated with 
5 antitumor cell immunity and the elimination of tumor cells from the body are: i) 
B-lymphocytes which secrete immunoglobulins into the blood plasma for 
identifying and labeling the nonself invader cells; ii) monocytes which secrete 
the complement proteins that are responsible for lysing and processing the 
immunoglobulin-coated target invader cells; iii) natural killer lymphocytes 

1 0 having two mechanisms for the destruction of tumor cells, antibody-dependent 
cellular cytotoxicity and natural killing; and iv) T-lymphocytes possessing 
antigen-specific receptors and having the capacity to recognize a tumor cell 
carrying complementary marker molecules (Schreiber, H., 1989, in 
Fundamental Immunology (ed.) W. E. Paul, pp. 923-955). 

1 5 Cancer immunotherapy generally focuses on inducing humoral 

immune responses, cellular immune responses, or both. Moreover, it is well 
established that induction of CD4 + T helper cells is necessary in order to 
secondarily induce either antibodies or cytotoxic CD8 + T cells. Polypeptide 
antigens that are selective or ideally specific for cancer cells, particularly breast 

20 cancer cells, offer a powerful approach for inducing immune responses against 
breast cancer, and are an important aspect of the present invention. 

Therefore, in further aspects of the present invention, the 
pharmaceutical compositions described herein may be used to stimulate an 
immune response against cancer, particularly for the immunotherapy of breast 

25 cancer. Within such methods, the pharmaceutical compositions described 
herein are administered to a patient, typically a warm-blooded animal, 
preferably a human. A patient may or may not be afflicted with cancer. 
Pharmaceutical compositions and vaccines may be administered either prior to 
or following surgical removal of primary tumors and/or treatment such as 

30 administration of radiotherapy or conventional chemotherapeutic drugs. As 
discussed above, administration of the pharmaceutical compositions may be by 
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any suitable method, including administration by intravenous, intraperitoneal, 
intramuscular, subcutaneous, intranasal, intradermal, anal, vaginal, topical and 
oral routes. 

Within certain embodiments, immunotherapy may be active 

5 immunotherapy, in which treatment relies on the in vivo stimulation of the 
endogenous host immune system to react against tumors with the 
administration of immune response-modifying agents (such as polypeptides 
and polynucleotides as provided herein). 

Within other embodiments, immunotherapy may be passive > 

1 0 immunotherapy, in which treatment involves the delivery of agents with 

established tumor-immune reactivity (such as effector cells or antibodies) that 
can directly or indirectly mediate antitumor effects and does not necessarily 
depend on an intact host immune system. Examples of effector cells include T 
cells as discussed above, T lymphocytes (such as CD8 + cytotoxic T 

1 5 lymphocytes and CD4 + T-helper tumor-infiltrating lymphocytes) , killer cells 

(such as Natural Killer cells and lymphokine-activated killer cells), B cells and i 4 
antigen-presenting cells (such as dendritic cells and macrophages) expressing 
a polypeptide provided herein. T cell receptors and antibody receptors specific 
for the polypeptides recited herein may be cloned, expressed and transferred 

20 into other vectors or effector cells for adoptive immunotherapy. The 

polypeptides provided herein may also be used to generate antibodies or anti- 
idiotype antibodies (as described above and in U.S. Patent No. 4,918,164) for 
passive immunotherapy. 

Monoclonal antibodies may be labeled with any of a variety of 

25 labels for desired selective usages in detection, diagnostic assays or 
therapeutic applications (as described in U.S. Patent Nos. 6,090,365; 
6,01 5,542; 5,843,398; 5,595,721 ; and 4,708,930, hereby incorporated by 
reference in their entirety as if each was incorporated individually). In each 
case, the binding of the labelled monoclonal antibody to the determinant site of 

30 the antigen will signal detection or delivery of a particular therapeutic agent to 
the antigenic determinant on the non-normal cell. A further object of this 
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invention is to provide the specific monoclonal antibody suitably labelled for 
achieving such desired selective usages thereof. 

Effector cells may generally be obtained in sufficient quantities for 
adoptive immunotherapy by growth in vitro, as described herein. Culture 

5 conditions for expanding single antigen-specific effector cells to several billion 
in number with retention of antigen recognition in vivo are well known in the art. 
Such in vitro culture conditions typically use intermittent stimulation with 
antigen, often in the presence of cytokines (such as IL-2) and non-dividing 
feeder cells. As noted above, immunoreactive polypeptides as provided herein 

1 0 may be used to rapidly expand antigen-specific T cell cultures in order to 

generate a sufficient number of cells for immunotherapy. In particular, antigen- 
presenting cells, such as dendritic, macrophage, monocyte, fibroblast and/or B 
cells, may be pulsed with immunoreactive polypeptides or transfected with one 
or more polynucleotides using standard techniques well known in the art. For 

1 5 example, antigen-presenting cells can be transfected with a polynucleotide 
having a promoter appropriate for increasing expression in a recombinant virus 
or other expression system. Cultured effector cells for use in therapy must be 
able to grow and distribute widely, and to survive long term in vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to 

20 survive long term in substantial numbers by repeated stimulation with antigen 
supplemented with IL-2 (see, for example, Cheever et al., Immunological 
Reviews 757:177, 1997). 

Alternatively, a vector expressing a polypeptide recited herein 
may be introduced into antigen presenting cells taken from a patient and 

25 clonally propagated ex vivo for transplant back into the same patient. 

Transfected cells may be reintroduced into the patient using any means known 
in the art, preferably in sterile form by intravenous, intracavitary, intraperitoneal 
or intratumor administration. 

Routes and frequency of administration of the therapeutic 

30 compositions described herein, as well as dosage, will vary from individual to 
individual, and may be readily established using standard techniques. In 
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general, the pharmaceutical compositions and vaccines may be administered 
by injection (e.g., intracutaneous, intramuscular, intravenous or subcutaneous), 
intranasal^ (e.g., by aspiration) or orally. Preferably, between 1 and 10 doses 
may be administered over a 52 week period. Preferably, 6 doses are 
5 administered, at intervals of 1 month, and booster vaccinations may be given 
periodically thereafter. Alternate protocols may be appropriate for individual 
patients. A suitable dose is an amount of a compound that, when administered 
as described above, is capable of promoting an anti-tumor immune response, 
and is at least 1 0-50% above the basal (/.e M untreated) level. Such response 
1 0 can be monitored by measuring the anti-tumor antibodies in a patient or by 
vaccine-dependent generation of cytolytic effector cells capable of killing the 
patients tumor cells in vitro. Such vaccines should also be capable of causing 
an immune response that leads to an improved clinical outcome (e.g., more 
frequent remissions, complete or partial or longer disease-free survival) in 
15 vaccinated patients as compared to non-vaccinated patients. In general, for 
pharmaceutical compositions and vaccines comprising one or more 
polypeptides, the amount of each polypeptide present in a dose ranges from 
about 25 ng to 5 mg per kg of host. Suitable dose sizes will vary with the size 
of the patient, but will typically range from about 0.1 mL to about 5 mL 
20 In general, an appropriate dosage and treatment regimen 

provides the active compound(s) in an amount sufficient to provide therapeutic 
and/or prophylactic benefit. Such a response can be monitored by establishing 
an improved clinical outcome (e.g., more frequent remissions, complete or 
partial, or longer disease-free survival) in treated patients as compared to non- 
25 treated patients. Increases in preexisting immune responses to a tumor protein 
generally correlate with an improved clinical outcome. Such immune 
responses may generally be evaluated using standard proliferation, cytotoxicity 
or cytokine assays, which may be performed using samples obtained from a 
patient before and after treatment. 
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Cancer Detection and Diagnostic Compositions, Methods and Kits 

In general, a cancer may be detected in a patient based on the 
presence of one or more breast tumor proteins and/or polynucleotides 
encoding such proteins in a biological sample (for example, blood, sera, 
5 sputum urine and/or tumor biopsies) obtained from the patient. In other words, 
such proteins may be used as markers to indicate the presence or absence of 
a cancer such as breast cancer. In addition, such proteins may be useful for 
the detection of other cancers. The binding agents provided herein generally 
permit detection of the level of antigen that binds to the agent in the biological 
10 sample. 

Polynucleotide primers and probes may be used to detect the 
level of mRNA encoding a tumor protein, which is also indicative of the 
presence or absence of a cancer. In general, a tumor sequence should be 
present at a level that is at least two-fold, preferably three-fold, and more 

1 5 preferably five-fold or higher in tumor tissue than in normal tissue of the same 
type from which the tumor arose. Expression levels of a particular tumor 
sequence in tissue types different from that in which the tumor arose are 
irrelevant in certain diagnostic embodiments since the presence of tumor cells 
can be confirmed by observation of predetermined differential expression 

20 levels, e.g., 2-fold, 5-fold, etc, in tumor tissue to expression levels in normal 
tissue of the same type. 

Other differential expression patterns can be utilized 
advantageously for diagnostic purposes. For example, in one aspect of the 
invention, overexpression of a tumor sequence in tumor tissue and normal 

25 tissue of the same type, but not in other normal tissue types, e.g., PBMCs, can 
be exploited diagnostically. In this case, the presence of metastatic tumor 
cells, for example in a sample taken from the circulation or some other tissue 
site different from that in which the tumor arose, can be identified and/or 
confirmed by detecting expression of the tumor sequence in the sample, for 

30 example using RT-PCR analysis. In many instances, it will be desired to enrich 



95 



WO030 1 3431 rhttp://www.qetthepatent.com/Lo qi n .dog/Spate nts4/@21 0121 %2E41 9C 1 2/Fetch/WO03Q 1 3431 .cpc?f romCache=1 part;^^ 
WO 03/013431 PCT/US02/24917 

for tumor cells in the sample of interest, e.g., PBMCs, using cell capture or 
other like techniques. 

There are a variety of assay formats known to those of ordinary 
skill in the art for using a binding agent to detect polypeptide markers in a 
5 sample. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold 
Spring Harbor Laboratory, 1988. In general, the presence or absence of a 
canoer in a patient may be determined by (a) contacting a biological sample 
obtained from a patient with a binding agent; (b) detecting in the sample a level 
of polypeptide that binds to the binding agent; and (c) comparing the level of 

1 0 polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding 
agent immobilized on a solid support to bind to and remove the polypeptide 
from the remainder of the sample. The bound polypeptide may then be 
detected using a detection reagent that contains a reporter group and 

1 5 specifically binds to the binding agent/polypeptide complex. Such detection 
reagents may comprise, for example, a binding agent that specifically binds to 
the polypeptide or an antibody or other agent that specifically binds to the 
binding agent, such as an anti-immunoglobulin, protein G, protein A or a lectin. 
Alternatively, a competitive assay may be utilized, in which a polypeptide is 

20 labeled with a reporter group and allowed to bind to the immobilized binding 
agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide 
to the binding agent is indicative of the reactivity of the sample with the 
immobilized binding agent. Suitable polypeptides for use within such assays 

25 include full length breast tumor proteins and polypeptide portions thereof to 
which the binding agent binds, as described above. 

The solid support may be any material known to those of ordinary 
skill in the art to which the tumor protein may be attached. For example, the 
solid support may be a test well in a microtiter plate or a nitrocellulose or other 

30 suitable membrane. Alternatively, the support may be a bead or disc, such as 
glass, f iberglass, latex or a plastic material such as polystyrene or 
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polyvinylchloride. The support may also be a magnetic particle or a fiber optic 
sensor, such as those disclosed, for example, in U.S. Patent No. 5,359,681 . 
The binding agent may be immobilized on the solid support using a variety of 
techniques known to those of skill in the art, which are amply described in the 

5 patent and scientific literature. In the context of the present invention, the term 
"immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking 
agent). Immobilization by adsorption to a well in a microtiter plate or to a 

1 0 membrane is preferred. In such cases, adsorption may be achieved by 
contacting the binding agent, in a suitable buffer, with the solid support for a 
suitable amount of time. The contact time varies with temperature, but is 
typically between about 1 hour and about 1 day. In general, contacting a well 
of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an 

1 5 amount of binding agent ranging from about 1 0 ng to about 1 0 vq, and 

preferably about 100 ng to about 1 jt/g, is sufficient to immobilize an adequate 

amount of binding agent. 

Covalent attachment of binding agent to a solid support may 
generally be achieved by first reacting the support with a bifunctional reagent 

20 that will react with both the support and a functional group, such as a hydroxyl 
or amino group, on the binding agent. For example, the binding agent may be 
covalently attached to supports having an appropriate polymer coating using 
benzoquinone or by condensation of an aldehyde group on the support with an 
amine and an active hydrogen on the binding partner (see, e.g., Pierce 

25 Immunotechnology Catalog and Handbook, 1 991 , at A1 2-A1 3). 

In certain embodiments, the assay is a two-antibody sandwich 
assay. This assay may be performed by first contacting an antibody that has 
been immobilized on a solid support, commonly the well of a microtiter plate, 
with the sample, such that polypeptides within the sample are allowed to bind 

30 to the immobilized antibody. Unbound sample is then removed from the 
immobilized polypeptide-antibody complexes and a detection reagent 
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(preferably a second antibody capable of binding to a different site on the 
polypeptide) containing a reporter group is added. The amount of detection 
reagent that remains bound to the solid support is then determined using a 
method appropriate for the specific reporter group. 

5 More specifically, once the antibody is immobilized on the support 

as described above, the remaining protein binding sites on the support are 
typically blocked. Any suitable blocking agent known to those of ordinary skill 
in the art, such as bovine serum albumin or Tween 20™ (Sigma Chemical Co., 
St. Louis, MO). The immobilized antibody is then incubated with the sample, 

10 and polypeptide is allowed to bind to the antibody. The sample may be diluted 
with a suitable diluent, such as phosphate-buffered saline (PBS) prior to 
incubation. In general, an appropriate contact time (/.e M incubation time) is a 
period of time that is sufficient to detect the presence of polypeptide within a 
sample obtained from an individual with breast cancer at least about 95% of 

15 that achieved at equilibrium between bound and unbound polypeptide. Those 
of ordinary skill in the art will recognize that the time necessary to achieve 
equilibrium may be readily determined by assaying the level of binding that 
occurs over a period of time. At room temperature, an incubation time of about 
30 minutes is generally sufficient. 

20 Unbound sample may then be removed by washing the solid 

support with an appropriate buffer, such as PBS containing 0.1 % Tween 20™ . 
The second antibody, which contains a reporter group, may then be added to 
the solid support. Preferred reporter groups include those groups recited 
above. 

25 The detection reagent is then incubated with the immobilized 

antibody-polypeptide complex for an amount of time sufficient to detect the 
bound polypeptide. An appropriate amount of time may generally be 
determined by assaying the level of binding that occurs over a period of time. 
Unbound detection reagent is then removed and bound detection reagent is 

30 detected using the reporter group. The method employed for detecting the 
reporter group depends upon the nature of the reporter group. For radioactive 
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groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent 
groups and fluorescent groups. Biotin may be detected using avidin, coupled 
to a different reporter group (commonly a radioactive or fluorescent group or an 
5 enzyme). Enzyme reporter groups may generally be detected by the addition 
of substrate (generally for a specific period of time), followed by spectroscopic 
or other analysis of the reaction products. 

To determine the presence or absence of a cancer, such as 
breast cancer, the signal detected from the reporter group that remains bound 

10 to the solid support is generally compared to a signal that corresponds to a 
predetermined cut-off value. In one preferred embodiment, the cut-off value for 
the detection of a cancer is the average mean signal obtained when the 
immobilized antibody is incubated with samples from patients without the 
cancer. In general, a sample generating a signal that is three standard 

1 5 deviations above the predetermined cut-off value is considered positive for the 
cancer. In an alternate preferred embodiment, the cut-off value is determined 
using a Receiver Operator Curve, according to the method of Sackett et al., 
Clinical Epidemiology: A Basic Science for Clinical Medicine, Little Brown and 
Co., 1985, p. 106-7. Briefly, in this embodiment, the cut-off value may be 

20 determined from a plot of pairs of true positive rates (/.e. f sensitivity) and false 
positive rates (1 00%-specif icity) that correspond to each possible cut-off value 
for the diagnostic test result. The cut-off value on the plot that is the closest to 
the upper left-hand corner (i.e., the value that encloses the largest area) is the 
most accurate cut-off value, and a sample generating a signal that is higher 

25 than the cut-off value determined by this method may be considered positive. 
Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative 
rate. In general, a sample generating a signal that is higher than the cut-off 
value determined by this method is considered positive for a cancer. 

30 In a related embodiment, the assay is performed in a flow-through 

or strip test format, wherein the binding agent is immobilized on a membrane, 
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such as nitrocellulose. In the flow-through test, polypeptides within the sample 
bind to the immobilized binding agent as the sample passes through the 
membrane. A second, labeled binding agent then binds to the binding agent- 
polypeptide complex as a solution containing the second binding agent flows 
5 through the membrane. The detection of bound second binding agent may 
then be performed as described above. In the strip test format, one end of the 
membrane to which binding agent is bound is immersed in a solution containing 
the sample. The sample migrates along the membrane through a region 
containing second binding agent and to the area of immobilized binding agent. 

1 0 Concentration of second binding agent at the area of immobilized antibody 
indicates the presence of a cancer. Typically, the concentration of second 
binding agent at that site generates a pattern, such as a line, that can be read 
visually. The absence of such a pattern indicates a negative result. In general, 
the amount of binding agent immobilized on the membrane is selected to 

1 5 generate a visually discernible pattern when the biological sample contains a 
level of polypeptide that would be sufficient to generate a positive signal in the 
two-antibody sandwich assay, in the format discussed above. Preferred 
binding agents for use in such assays are antibodies and antigen-binding 
fragments thereof. Preferably, the amount of antibody immobilized on the 

20 membrane ranges from about 25 ng to about 1 ng, and more preferably from 
about 50 ng to about 500 ng. Such tests can typically be performed with a very 
small amount of biological sample. 

Of course, numerous other assay protocols exist that are suitable 
for use with the tumor proteins or binding agents of the present invention. The 

25 above descriptions are intended to be exemplary only. For example, it will be 
apparent to those of ordinary skill in the art that the above protocols may be 
readily modified to use tumor polypeptides to detect antibodies that bind to 
such polypeptides in a biological sample. The detection of such tumor protein 
specific antibodies may correlate with the presence of a cancer. 

30 A cancer may also, or alternatively, be detected based on the 

presence of T cells that specifically react with a tumor protein in a biological 
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sample. Within certain methods, a biological sample comprising CD4 + and/or 
CD8 + T cells isolated from a patient is incubated with a tumor polypeptide, a 
polynucleotide encoding such a polypeptide and/or an APC that expresses at 
least an immunogenic portion of such a polypeptide, and the presence or 
5 absence of specific activation of the T cells is detected. Suitable biological 
samples include, but are not limited to, isolated T cells. For example, T cells 
may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). 
T cells may be incubated in vitro for 2-9 days (typically 4 days) at 37°C with 

10 polypeptide (e.g., 5 - 25 jxg/ml). It may be desirable to incubate another aliquot 
of a T cell sample in the absence of tumor polypeptide to serve as a control. 
For CD4 + T cells, activation is preferably detected by evaluating proliferation of 
the T cells. For CD8 + T cells, activation is preferably detected by evaluating 
cytolytic activity. A level of proliferation that is at least two fold greater and/or a 

15 level of cytolytic activity that is at least 20% greater than in disease-free 
patients indicates the presence of a cancer in the patient. 

As noted above, a cancer may also, or alternatively, be detected 
based on the level of mRNA encoding a tumor protein in a biological sample. 
For example, at least two oligonucleotide primers may be employed in a 

20 polymerase chain reaction (PCR) based assay to amplify a portion of a tumor 
cDNA derived from a biological sample, wherein at least one of the 
oligonucleotide primers is specific for (/.e., hybridizes to) a polynucleotide 
encoding the tumor protein. The amplified cDNA is then separated and 
detected using techniques well known in the art, such as gel electrophoresis. 

25 Similarly, oligonucleotide probes that specifically hybridize to a 

polynucleotide encoding a tumor protein may be used in a hybridization assay 
to detect the presence of polynucleotide encoding the tumor protein in a 
biological sample. 

To permit hybridization under assay conditions, oligonucleotide 

30 primers and probes should comprise an oligonucleotide sequence that has at 
least about 60%, preferably at least about 75% and more preferably at least 
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about 90%, identity to a portion of a polynucleotide encoding a tumor protein of 
the invention that is at least 10 nucleotides, and preferably at least 20 
nucleotides, in length. Preferably, oligonucleotide primers and/or probes 
hybridize to a polynucleotide encoding a polypeptide described herein under 
5 moderately stringent conditions, as defined above. Oligonucleotide primers 
and/or probes which may be usefully employed in the diagnostic methods 
described herein preferably are at least 10-40 nucleotides in length. In a 
preferred embodiment, the oligonucleotide primers comprise at least 10 
contiguous nucleotides, more preferably at least 15 contiguous nucleotides, of 
10 a DNA molecule having a sequence as disclosed herein. Techniques for both 
PCR based assays and hybridization assays are well known in the art (see, for 
example, Mullis et al., Cold Spring Harbor Symp. Quant. Biol., 51:263, 1987; 
Erlich ed., PCR Technology, Stockton Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied 
1 5 in conjunction with reverse transcription. Typically, RNA is extracted from a 
biological sample, such as biopsy tissue, and is reverse transcribed to produce 
cDNA molecules. PCR amplification using at least one specific primer 
generates a cDNA molecule, which may be separated and visualized using, for 
example, gel electrophoresis. Amplification may be performed on biological 
20 samples taken from a test patient and from an individual who is not afflicted 
with a cancer. The amplification reaction may be performed on several 
dilutions of cDNA spanning two orders of magnitude. A two-fold or greater 
increase in expression in several dilutions of the test patient sample as 
compared to the same dilutions of the non-cancerous sample is typically 
25 considered positive. 

In another aspect of the present invention, cell capture 
technologies may be used in conjunction, with, for example, real-time PCR to 
provide a more sensitive tool for detection of metastatic cells expressing breast 
tumor antigens. Detection of breast cancer cells in biological samples, e.g., 
30 bone marrow samples, peripheral blood, and small needle aspiration samples 
is desirable for diagnosis and prognosis in breast cancer patients. 
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Immunomagnetic beads coated with specific monoclonal 
antibodies to surface cell markers, or tetrameric antibody complexes, may be 
used to first enrich or positively select cancer cells in a sample. Various 
commercially available kits may be used, including Dynabeads® Epithelial 
5 Enrich (Dynal Biotech, Oslo, Norway), StemSep™ (StemCell Technologies, 
Inc., Vancouver, BC), and RosetteSep (StemCell Technologies). A skilled 
artisan will recognize that other methodologies and kite may also be used to 
enrich or positively select desired cell populations. Dynabeads® Epithelial 
Enrich contains magnetic beads coated with mAbs specific for two glycoprotein 

1 0 membrane antigens expressed on normal and neoplastic epithelial tissues. 
The coated beads may be added to a sample and the sample then applied to a 
magnet, thereby capturing the cells bound to the beads. The unwanted cells 
are washed away and the magnetically isolated cells eluted from the beads and 
used in further analyses. 

1 5 RosetteSep can be used to enrich cells directly from a blood 

sample and consists of a cocktail of tetrameric antibodies that targets a variety 
of unwanted cells and crosslinks them to glycophorin A on red blood cells 
(RBC) present in the sample, forming rosettes. When centrifuged over Ficoll, 
targeted cells pellet along with the free RBC. The combination of antibodies in 

20 the depletion cocktail determines which cells will be removed and consequently 
which cells will be recovered. Antibodies that are available include, but are not 
limited to: CD2, CD3, CD4, CDS, CD8, CD10, CD11b, CD14, CD15, CD16, 
CD19, CD20, CD24, CD25, CD29, CD33, CD34, CD36, CD38, CD41, CD45, 
CD45RA, CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE, and TCRap. 

25 Additionally, it is contemplated in the present invention that mAbs 

specific for breast tumor antigens can be generated and used in a similar 
manner. For example, mAbs that bind to tumor-specific cell surface antigens 
may be conjugated to magnetic beads, or formulated in a tetrameric antibody 
complex, and used to enrich or positively select metastatic breast tumor cells 

30 from a sample. Once a sample is enriched or positively selected, cells may be 
lysed and RNA isolated. RNA may then be subjected to RT-PCR analysis 
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using breast tumor-specific primers in a real-time PCR assay as described 
herein. One skilled in the art will recognize that enriched or selected 
populations of cells may be analyzed by other methods (e.g., in situ 

hybridization or flow cytometry). 
5 In another embodiment, the compositions described herein may 

be used as markers for the progression of cancer. In this embodiment, assays 
as described above for the diagnosis of a cancer may be performed overtime, 
and the change in the level of reactive polypeptide(s) or polynucleotide® 
evaluated. For example, the assays may be performed every 24-72 hours for a 
10 period of 6 months to 1 year, and thereafter performed as needed. In general, 
a cancer is progressing in those patients in whom the level of polypeptide or 
polynucleotide detected increases over time. In contrast, the cancer is not 
progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 
1 5 certain in vivo diagnostic assays may be performed directly on a 

tumor. One such assay involves contacting tumor cells with a binding agent. 
The bound binding agent may then be detected directly or indirectly via a 
reporter group. Such binding agents may also be used in histological 
applications. Alternatively, polynucleotide probes may be used within such 
20 applications. 

As noted above, to improve sensitivity, multiple tumor protein 
markers may be assayed within a given sample. It will be apparent that binding 
agents specific for different proteins provided herein may be combined within a 
single assay. Further, multiple primers or probes may be used concurrently. 

25 The selection of tumor protein markers may be based on routine experiments 
to determine combinations that results in optimal sensitivity. In addition, or 
alternatively, assays for tumor proteins provided herein may be combined with 
assays for other known tumor antigens. 

The present invention further provides kits for use within any of 

30 the above diagnostic methods. Such kits typically comprise two or more 
components necessary for performing a diagnostic assay. Components may 
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be compounds, reagents, containers and/or equipment. For example, one 
container within a kit may contain a monoclonal antibody or fragment thereof 
that specifically binds to a tumor protein. Such antibodies or fragments may be 
provided attached to a support material, as described above. One or more 
5 additional containers may enclose elements, such as reagents or buffers, to be 
used in the assay. Such kits may also, or alternatively, contain a detection 
reagent as described above that contains a reporter group suitable for direct or 
indirect detection of antibody binding. 

Alternatively, a kit may be designed to detect the level of mRNA 

1 0 encoding a tumor protein in a biological sample. Such kits generally comprise 
at least one oligonucleotide probe or primer, as described above, that 
hybridizes to a polynucleotide encoding a tumor protein. Such an 
oligonucleotide may be used, for example, within a PCR or hybridization assay. 
Additional components that may be present within such kits include a second 

1 5 oligonucleotide and/or a diagnostic reagent or container to facilitate the 
detection of a polynucleotide encoding a tumor protein. 

The following Examples are offered by way of illustration and not 
byway of limitation. 



EXAMPLES 



20 EXAMPLE 1 

Preparation of Breast Tumor-Specific cDNAs Using 
Differential Display RT-PCR 

This Example illustrates the preparation of cDNA molecules 
encoding breast tumor-specific polypeptides using a differential display screen. 

25 A. Preparation of B1 8Aq1 cDNA and Characterization of mRNA Expression 
Tissue samples were prepared from breast tumor and normal 
tissue of a patient with breast cancer that was confirmed by pathology after 
removal from the patient. Normal RNA and tumor RNA was extracted from the 
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samples and mRNA was isolated and converted into cDNA using a (dDiaAG 
(SEQ ID NO:1 30) anchored 3' primer. Differential display PCR was then 
executed using a randomly chosen primer (CTTCAACCTC) (SEQ ID NO:103). 
Amplification conditions were standard buffer containing 1.5 mM MgCI 2l 
5 20 pmol of primer, 500 pmol dNTP, and 1 unit of Taq DNA polymerase (Perkin- 
Elmer, Branchburg, NJ). Forty cycles of amplification were performed using 
94°C denaturation for 30 seconds, 42°C annealing for 1 minute, and 72°C 
extension for 30 seconds. An RNA fingerprint containing 76 amplified products 
was obtained. Although the RNA fingerprint of breast tumor tissue was over 
1 o 98% identical to that of the normal breast tissue, a band was repeatedly 

observed to be specific to the RNA fingerprint pattern of the tumor. This band 
was cut out of a silver stained gel, subcloned into the T-vector (Novagen, 

Madison, Wl) and sequenced. 

The sequence of the cDNA, referred to as B1 8Ag1 , is provided in 
15 SEQIDNO:1. A database search of GENBANK and EMBL revealed that the 
B1 8Ag1 fragment initially cloned is 77% identical to the endogenous human 
retroviral element S71 , which is a truncated retroviral element homologous to 
the Simian Sarcoma Virus (SSV). S71 contains an incomplete gag gene, a 
portion of the pel gene and an LTR-like structure at the 3' terminus (see 
20 Werner et al., Virology 174:225-238 (1990)). B1 8Ag1 is also 64% identical to 
SSV in the region corresponding to the P30 (gag) locus. B18Ag1 contains 
three separate and incomplete reading frames covering a region which shares 
considerable homology to a wide variety of gag proteins of retroviruses which 
infect mammals. In addition, the homology to S71 is not just within the gag 
25 gene, but spans several kb of sequence including an LTR. 

B18Ag1 -specific PCR primers were synthesized using computer 
analysis guidelines. RT-PCR amplification <94"C, 30 seconds; 60°C -* 42°C, 
30 seconds; 72°C, 30 seconds for 40 cycles) confirmed that B18Ag1 
represents an actual mRNA sequence present at relatively high levels in the 
30 patient's breast tumor tfesue. The primers used in amplification were B1 8Ag1 -1 
(CTG CCT GAG CCA CAAATG) (SEQ ID NO:128) and B18Ag1-4 (CCG GAG 
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GAG GAA GCT AGA GGA ATA) (SEQ ID N0.1 29) at a 3.5 mM magnesium 
concentration and a pH of 8.5, and B18Ag1-2 (ATG GCT ATT TTC GGG GCC 
TGA CA) (SEQ ID NO:1 26) and B1 8Ag1 -3 (CCG GTA TCT CCT CGT GGG 
TATT) (SEQ ID NO:127) at 2 mM magnesium at pH 9.5. The same 

5 experiments showed exceedingly low to nonexistent levels of expression in this 
patient's normal breast tissue (see Figure 1). RT-PCR experiments were then 
used to show that B1 8Ag1 mRNA is present in nine other breast tumor samples 
(from Brazilian and American patients) but absent in, or at exceedingly low 
levels in, the normal breast tissue corresponding to each cancer patient. RT- 

1 0 PCR analysis has also shown that the B1 8Ag1 transcript is not present in 
various normal tissues (including lymph node, myocardium and liver) and 
present at relatively low levels in PBMC and lung tissue. The presence of 
B18Ag1 mRNA in breast tumor samples, and its absence from normal breast 
tissue, has been confirmed by Northern blot analysis, as shown in Figure 2. 

1 5 The differential expression of B1 8Ag1 in breast tumor tissue was 

also confirmed by RNase protection assays. Figure 3 shows the level of 
B18Ag1 mRNA in various tissue types as determined in four different RNase 
protection assays. Lanes 1-12 represent various normal breast tissue samples, 
lanes 13-25 represent various breast tumor samples; lanes 26-27 represent 

20 normal prostate samples; lanes 28-29 represent prostate tumor samples; lanes 
30-32 represent colon tumor samples; lane 33 represents normal aorta; lane 34 
represents normal small intestine; lane 35 represents normal skin, lane 36 
represents normal lymph node; lane 37 represents normal ovary; lane 38 
represents normal liver; lane 39 represents normal skeletal muscle; lane 40 

25 represents a first normal stomach sample, lane 41 represents a second normal 
stomach sample; lane 42 represents a normal lung; lane 43 represents normal 
kidney; and lane 44 represents normal pancreas. Interexperimental 
comparison was facilitated by including a positive control RNA of known B-actin 
message abundance in each assay and normalizing the results of the different 

30 assays with respect to this positive control. 
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RT-PCR and Southern Blot analysis has shown the B18Ag1 locus 
to be present in human genomic DNA as a single copy endogenous retroviral 
element. A genomic clone of approximately 1 2-1 8 kb was isolated using the 
initial B18Ag1 sequence as a probe. Four additional subclones were also 
5 isolated by Xbal digestion. Additional retroviral sequences obtained from the 
ends of the Xbal digests of these clones (located as shown in Figure 4) are 
shown as SEQ ID NO:3 - SEQ ID NO:10, where SEQ ID NO:3 shows the 
location of the sequence labeled 10 in Figure 4, SEQ ID NO:4 shows the 
location of the sequence labeled 1 1-29, SEQ ID NO:5 shows the location of the 
1 0 sequence labeled 3, SEQ ID NO:6 shows the location of the sequence labeled 
6, SEQ ID NO:7 shows the location of the sequence labeled 12, SEQ ID NO:8 
shows the location of the sequence labeled 13, SEQ ID NO:9 shows the 
location of the sequence labeled 14 and SEQ ID NO:10 shows the location of 
the sequence labeled 1 1-22. 
1 5 Subsequent studies demonstrated that the 1 2-1 8 kb genomic 

clone contains a retroviral element of about 7.75 kb, as shown in Figures 5A 
and 5B. The sequence of this retroviral element is shown in SEQ ID NO:141 . 
The numbered line at the top of Figure 5A represents the sense strand 
sequence of the retroviral genomic clone. The box below this line shows the 
20 position of selected restriction sites. The arrows depict the different 

overlapping clones used to sequence the retroviral element. The direction of 
the arrow shows whether the single-pass subclone sequence corresponded to 
the sense or anti-sense strand. Figure 5B is a schematic diagram of the 
retroviral element containing B18Ag1 depicting the organization of viral genes 
25 within the element. The open boxes correspond to predicted reading frames, 
starting with a methionine, found throughout the element. Each of the six likely 
reading frames Is shown, as indicated to the left of the boxes, with frames 1 -3 
corresponding to those found on the sense strand. 

Using the cDNA of SEQ ID NO:1 as a probe, a longer cDNA was 
30 obtained (SEQ ID NO:227) which contains minor nucleotide differences (less 
than 1%) compared to the genomic sequence shown in SEQ ID NO:141 . 
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B. Preparation of cDNA Molecules Encoding Other Breast Tumor-Specific 
Polypeptides 

Normal RNA and tumor RNA was prepared and mRNA was 
isolated and converted into cDNA using a (dT)i 2 AG anchored 3' primer, as 
5 described above. Differential display PCR was then executed using the 

randomly chosen primers of SEQ ID NO:87-1 25. Amplification conditions were 
as noted above, and bands observed to be specific to the RNA fingerprint 
pattern of the tumor were cut out of a silver stained gel, subcloned into either 
the T-vector (Novagen, Madison, Wl) or the pCRII vector (Invitrogen, San 

10 Diego, CA) and sequenced. The sequences are provided in SEQ ID NO:1 1 - 
SEQ ID NO:86. Of the 79 sequences isolated, 67 were found to be novel (SEQ 
ID NO:1 1-26 and 28-77) (see also Figures 6-20). 

An extended DNA sequence (SEQ ID NO:290) for the antigen 
B15Ag1 (originally identified partial sequence provided in SEQ ID NO:27) was 

1 5 obtained in further studies. Comparison of the sequence of SEQ ID NO:290 
with those in the gene bank as described above, revealed homology to the 
known human p-A activin gene. Further studies led to the isolation of the full- 
length cDNA sequence for the antigen B21GT2 (also referred to as B31 1 D; 
originally identified partial cDNA sequence provided in SEQ ID NO:56). The 

20 full-length sequence is provided in SEQ ID NO:307, with the corresponding 
amino acid sequence being provided in SEQ ID NO:308. Further studies led to 
the isolation of a splice variant of B31 1 D. The B311D clone of SEQ ID NO:31 6 
was sequenced and a Xhol/Notl fragment from this clone was gel purified and 
32P-cDTP labeled by random priming for use as a probe for further screening 

25 to obtain additional B31 1 D gene sequence. Two fractions of a human breast 
tumor cDNA bacterial library were screened using standard techniques. One of 
the clones isolated in this manner yielded additional sequence which includes a 
poly A+ tail. The determined cDNA sequence of this clone (referred to as 
B311D_BT1_1A) is provided in SEQ ID NO:317. The sequences of SEQ ID 

30 NO:31 6 and 31 7 were found to share identity over a 464 bp region, with the 
sequences diverging near the poly A+ sequence of SEQ ID NO:317. 
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Subsequent studies identified an additional 1 46 sequences (SEQ 
ID NO:142-289), of which 115 appeared to be novel (SEQ ID NO:142, 143, 
146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 
216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 
5 271-273, 275, 276, 278, 280, 281 , 284, 288 and 291 ). To the best of the 
inventors' knowledge none of the previously identified sequences have 
heretofore been shown to be expressed at a greater level in human breast 
tumor tissue than in normal breast tissue. 

In further studies, several different splice forms of the antigen 
1 0 B1 1 Ag1 (also referred to as B305D) were isolated, with each of the various 
splice forms containing slightly different versions of the B1 1 Ag1 coding frame. 
Splice junction sequences define individual exons which, in various patterns 
and arrangements, make up the various splice forms. Primers were designed 
to examine the expression pattern of each of the exons using RT-PCR as 
1 5 described below. Each exon was found to show the same expression pattern 
as the original B1 1 Ag1 clone, with expression being breast tumor-, normal 
prostate- and normal testis-specific. The determined cDNA sequences for the 
isolated protein coding exons are provided in SEQ ID NO:292-298, 
respectively. The predicted amino acid sequences corresponding to the 
20 sequences of SEQ ID NO:292 and 298 are provided in SEQ ID NO:299 and 
300. Additional studies using rapid amplification of cDNA ends (RACE), a 5' 
specific primer to one of the splice forms of B1 1 Ag1 provided above and a 
breast adenocarcinoma, led to the isolation of three additional, related, splice 
forms referred to as isoforms B1 1 C-1 5, B1 1 C-8 and B1 1 C-9,1 6. The 
25 determined cDNA sequences for these isoforms are provided in SEQ ID NO: 
301-303, with the corresponding predicted amino acid sequences being 
provided in SEQ ID NO:304-306. 

The protein coding region of B1 1C-15 (SEQ ID NO: 301 ; also 
referred to as B305D isoform C) was used as a query sequence in a BLASTN 
30 search of the Genbank DNA database. A match was found to a genomic clone 
from chromosome 21 (Accessson no. AP001465). The pairwise alignments 
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provided in the BLASTN output were used to identify the putative exon, or 
coding, sequence of the chromosome 21 sequence that corresponds to the 
B305D sequence. Based on the BlastN pairwise alignments, the following 
pieces of GenBank record AP001465 were put together: base pairs 67978- 
5 68499, 72870-72987, 73144-73335, 76085-76206, 77905-78085, 80520- 
80624, 87602-87633. This sequence was then aligned with the B305D isoform 
C sequence using the DNA Star Seqman program and excess sequence was 
deleted in such a way as to maintain the sequence most similar to B305D. The 
final edited form of the chromosome 21 sequence was 96.5% identical to 

1 0 B305D. This resulting edited sequence from chromosome 21 was then 

translated and found to contain no stop codons other than the final stop codon 
in the same position as that for B305D. As with B305D, the chromosome 21 
sequence (provided in SEQ ID NO: 325) encoded a protein (SEQ ID NO: 326) 
with 384 amino acids. An alignment of this protein with the B305D isoform C 

15 protein (SEQ ID NO: 304) showed 90% amino acid identity. 

The cDNA sequence of B305D isoform C (SEQ ID NO: 301) was 
used to identify homologs by searching the High Throughput Genome 
Sequencing (HTGS) database (NCBI, National Institutes for Health, Bethesda, 
MD). Homologs were identified on Chromosome 2 (Clone ID 9838181), 

20 Chromosome 1 0 (Clone ID 1 0933022), Chromosome 15 (Clone ID 1 1560284). 
These homologs shared greater than 90% identity with B305D isoform C at the 
nucleic acid level. All three of these homologs encode 384 amino acid ORFs 
that share greater than 90% identity with the amino acid sequence of SEQ ID 
NO: 304. Further searching of the GenBank database with the sequence of 

25 SEQ ID NO: 301 yielded a partial sequence homolog on Chromosome 22 
(Clone ID 5931 507). cDNA sequences for the Chromosome 2, 1 0, 1 5 and 22 
homologs were constructed based on the homology with B305D isoform C and 
the conserved sequences at intron-exon junctions. The cDNA sequences for 
the Chromosome 22, 2, 15 and 10 homologs are provided in SEQ ID NO: 327- 

30 330, respectively, with the corresponding amino acid sequences being provided 
in SEQ ID NO: 331, 334, 333 and 332, respectively. 
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In subsequent studies on B305D Isoform A (cDNA sequence 
provided in SEQ ID NO.292), the cDNA sequence (provided in SEQ ID 
NO:313) was found to contain an additional guanine residue at position 884, 
leading to a frameshift in the open reading frame. The determined DNA 
5 sequence of this ORF is provided in SEQ ID NO:31 4. This frameshift 

generates a protein sequence (provided in SEQ ID NO:315) of 293 amino acids 
that contains the C-terminal domain common to the other isoforms of B305D 
but that differs in the N-terminal region. 

EXAMPLE 2 

1 o Preparation of B1 8Ag1 DNA From Human Genomic DNA 

This Example illustrates the preparation of B18Ag1 DNA by 
amplification from human genomic DNA. 

B18Ag1 DNA may be prepared from 250 ng human genomic DNA 
using 20 pmol of B1 8Ag1 specific primers, 500 pmol dNTPS and 1 unit of Taq 
1 5 DNA polymerase (Perkin Elmer, Branchburg, NJ) using the following 

amplification parameters: 94°Cfor 30 seconds denaturing, 30 seconds 60°C to 
42°C touchdown annealing in 2°C increments every two cycles and 72°C 
extension for 30 seconds. The last increment (a 42°C annealing temperature) 
should cycle 25 times. Primers were selected using computer analysis. 
20 Primers synthesized were B1 8Ag1 -1 , B1 8Ag1 -2, B1 8Ag1 -3, and B1 8Ag1 -4. 
Primer pairs that may be used are 1+3, 1+4, 2+3, and 2+4. 

Following gel electrophoresis, the band corresponding to B18Ag1 
DNA may be excised and cloned into a suitable vector. 

EXAMPLE 3 

25 Preparation of B18AG1 DNA From Breast Tumor cDNA 

This Example illustrates the preparation of B18Ag1 DNA by 
amplification from human breast tumor cDNA. 

First strand cDNA is synthesized from RNA prepared from human 
breast tumor tissue in a reaction mixture containing 500 ng poly A+ RNA, 200 
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pmol of the primer (T) 12 AG (/.e., TTT TTT TTT TTT AG) (SEQ ID NO:1 30), 1X 
first strand reverse transcriptase buffer, 6.7 mM DTT, 500 mmol dNTPs, and 1 
unit AMV or MMLV reverse transcriptase (from any supplier, such as Gibco- 
BRL (Grand Island, NY)) in a final volume of 30 [d. After first strand synthesis, 
5 the cDNA is diluted approximately 25 fold and 1 |xl is used for amplification as 
described in Example 2. While some primer pairs can result in a 
heterogeneous population of transcripts, the primers B18Ag1-2 (5'ATG GCT 
ATTTTC GGG GGCTGA CA) (SEQ ID NO:126) and B18Ag1-3 (5'CCG GTA 
TCT CCT CGT GGG TAT T) (SEQ ID NO:127) yield a single 151 bp 
10 amplification product. 

EXAMPLE 4 

Identification of B-cell and T-cell Epitopes of B1 8Ag1 

This Example illustrates the identification of B18Ag1 epitopes. 
The B18Ag1 sequence can be screened using a variety of 

1 5 computer algorithms. To determine B-cell epitopes, the sequence can be 
screened for hydrophobicity and hydrophilicity values using the method of 
Hopp, Prog. Clin. Biol. Res. 1 728:367-77 (1985) or, alternatively, Cease et al., 
J. Exp. Med. 164:1779-84 (1986) or Spouge et al., J. Immunol. 138:204-1 2 
(1987). Additional Class II MHC (antibody or B-cell) epitopes can be predicted 

20 using programs such as AMPHI (e.g., Margalit et al., J. Immunol. 138:2213 
(1987)) or the methods of Rothbard and Taylor {e.g., EMBO J. 7:93 (1988)). 

Once peptides (15-20 amino acids long) are identified using these 
techniques, individual peptides can be synthesized using automated peptide 
synthesis equipment (available from manufacturers such as Perttn 

25 Elmer/Applied Biosystems Division, Foster City, CA) and techniques such as 
Merrifield synthesis. Following synthesis, the peptides can used to screen sera 
harvested from either normal or breast cancer patients to determine whether 
patients with breast cancer possess antibodies reactive with the peptides. 
Presence of such antibodies in breast cancer patient would confirm the 

30 immunogenicity of the specific B-cell epitope in question. The peptides can 
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also be tested for their ability to generate a serologic or humoral immune in 
animals (mice, rats, rabbits, chimps etc.) following immunization in vivo. 
Generation of a peptide-specific antiserum following such immunization further 
confirms the immunogenicity of the specific B-cell epitope In question. 

5 To identify T-cell epitopes, the B1 8Ag1 sequence can be 

screened using different computer algorithms which are useful in identifying 8- 
10 amino acid motifs within the B18Ag1 sequence which are capable of binding 
to HLA Class I MHC molecules, (see, e.g., Rammensee et al., Immunogenetics 
47:178-228 (1995)). Following synthesis such peptides can be tested for their 

10 ability to bind to class I MHC using standard binding assays (e.g., Sette et al., 
J. Immunol. 753:5586-92 (1994)) and more importantly can be tested for their 
ability to generate antigen reactive cytotoxic T-cells following in vitro stimulation 
of patient or normal peripheral mononuclear cells using, for example, the 
methods of Bakker et al., Cancer Res. 55:5330-34 (1995); Visseren et al., J. 

15 Immunol. 754:3991-98 (1995); Kawakami et al., J. Immunol. 754:3961 -68 
(1995); and Kast et al., J. Immunol. 752:3904-12 (1994). Successful in vitro 
generation of T-cells capable of killing autologous (bearing the same Class I 
MHC molecules) tumor cells following in vitro peptide stimulation further 
confirms the immunogenicity of the B18Ag1 antigen. Furthermore, such 

20 peptides may be used to generate murine peptide and B1 8Ag1 reactive 

cytotoxic T-cells following in vivo immunization in mice rendered transgenic for 
expression of a particular human MHC Class I haplotype (vltiello et al., J. Exp. 

Med. 773:1007-15 (1991). 

A representative list of predicted B18Ag1 B-cell and T-cell 
25 epitopes, broken down according to predicted HLA Class I MHC binding 
antigen, is shown below: 

Predicted Th Motifs (B-cell epitopes) (SEQ ID NOS.: 131-133) 
SSGGRTFDDFHRYLLVGI 
QGAAQ KPI NLSKXI EWQGHDE 
30 SPGVFLEHLQEAYRIYTPFDLSA 
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Predicted HLA A2.1 Motifs fT-cell epitopes) (SEQ ID NOS.: 134-140) 
YLLVGIQGA 
GAAQKPINL 
NLSKXIEW 
5 EWQGHDES 
HLQEAYRIY 
NLAFVAQAA 
FVAQAAPDS 

EXAMPLES 

1 o Identification of T-cell Epitopes of B1 1 Ag1 

This Example illustrates the identification of B1 1 Ag1 (also 
referred to as B305D) epitopes. Four peptides, referred to as B1 1 -8, B1 1 -1 , 
B1 1 -5 and B1 1 -1 2 (SEQ I D NO:309-31 2, respectfully) were derived from the 
B11Ag1 gene. 

1 5 Human CD8 T cells were primed in vitro to the peptide B1 1 -8 

using dendritic cells according to the protocol of Van Tsai et al. (Critical 
Reviews in Immunology 78:65-75, 1998). The resulting CD8 T cell cultures 
were tested for their ability to recognize the B1 1-8 peptide or a negative control 
peptide, presented by the B-LCL line, JY. Briefly, T cells were incubated with 

20 autologous monocytes in the presence of 1 0 ug/ml peptide, 1 0 ng/ml IL-7 and 
1 0 ug/ml IL-2, and assayed for their ability to specifically lyse target cells in a 
standard 51 -Cr release assay. As shown in Fig. 22, the bulk culture line 
demonstrated strong recognition of the B1 1 -8 peptide with weaker recognition 
of the peptide B11-1. 

25 A clone from this CTL line was isolated following rapid expansion 

using the monoclonal antibody OKT3 and human IL-2. As shown in Fig. 23, 
this clone (referred to as A1), in addition to being able to recognize specific 
peptide, recognized JY LCL transduced with the B1 1 Ag1 gene. This data 
demonstrates that B1 1-8 is a naturally processed epitope of the B1 1 Ag1 gene. 

30 In addition these T cells were further found to recognize and lyse, in an HLA-A2 
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restricted manner, an established tumor cell line naturally expressing B1 1 Agl 
(Fig. 24). The T cells strongly recognize a lung adenocarcinoma (LT-1 40-22) 
naturally expressing B1 1 Ag1 transduced with HLA-A2, as well as an A2+ breast 
carcinoma (CAMA-1) transduced with B1 1 Ag1 , but not untransduced lines or 
5 another negative tumor line (SW620). 

These data clearly demonstrate that these human T cells 
recognize not only B1 1 -specific peptides but also transduced cells, as well as 
naturally expressing tumor lines. 

CTL lines raised against the antigens B1 1 -5 and B1 1 -1 2, using 
1 0 the procedures described above, were found to recognize corresponding 
peptide-coated targets. 

EXAMPLE 6 

Characterization of Breast Tumor Genes Discovered By 
Differential Display PCR 

1 5 The specificity and sensitivity of the breast tumor genes 

discovered by differential display PCR were determined using RT-PCR. This 
procedure enabled the rapid evaluation of breast tumor gene mRNA expression 
semiquantitatively without using large amounts of RNA Using gene specific 
primers, mRNA expression levels in a variety of tissues were examined, 

20 including 8 breast tumors, 5 normal breasts, 2 prostate tumors, 2 colon tumors, 
1 Jung tumor, and 14 other normal adult human tissues, including normal 
prostate, colon, kidney, liver, lung, ovary, pancreas, skeletal muscle, skin, 
stomach and testes. 

To ensure the semiquantitative nature of the RT-PCR, p-actin 

25 was used as internal control for each of the tissues examined. Serial dilutions 
of the first strand cDNAs were prepared and RT-PCR assays performed using 
p-actin specific primers. A dilution was then selected that enabled the linear 
range amplification of p-actin template, and which was sensitive enough to 
reflect the difference in the initial copy number. Using this condition, the p-actin 

30 levels were determined for each reverse transcription reaction from each 
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tissue. DNA contamination was minimized by DNase treatment and by 
assuring a negative result when using first strand cDNA that was prepared 
without adding reverse transcriptase. 

Using gene specific primers, the mRNA expression levels were 

5 determined in a variety of tissues. To date, 38 genes have been successfully 
examined by RT-PCR, five of which exhibit good specificity and sensitivity for 
breast tumors (B15AG-1, B31GA1 b, B38GA2a, B11A1aand B18AG1a). 
Figures 21 A and 21 B depict the results for three of these genes: B15AG-1 
(SEQ ID NO:27), B31GA1b (SEQ ID NO:148) and B38GA2a (SEQ ID NO:157). 

1 0 Table 2 summarizes the expression level of all the genes tested in normal 
breast tissue and breast tumors, and also in other tissues. 



15 



20 



Table 2 

Percentage of Breast Cancer Antigens That Are Expresse d in Various 

Tissues 



Breast Tissues 



Over-expressed in Breast Tumors 84% 
Equally Expressed in Normals and Tumor 1 6% 





Over-expressed in Breast Tumors but 


9% 




not in any Normal Tissues 


Other Tissues 


Over-expressed in Breast Tumors but 






Expressed in Some Normal Tissues 


30% 




Over-expressed in Breast Tumors but 


61% 




Equally Expressed in All Other Tissues 



30 
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EXAMPLE 7 

Preparation and Characterization of Antibodies Against 
Breast Tumor Polypeptides 

Polyclonal antibodies against the breast tumor antigen B305D 

5 were prepared as follows. 

The breast tumor antigen expressed in an E. coli recombinant 
expression system was grown overnight in LB broth with the appropriate 
antibiotics at 37°C in a shaking incubator. The next morning, 1 0 ml of the 
overnight culture was added to 500 ml to 2x YT plus appropriate antibiotics in a 
1 0 2L-baffled Erlenmeyer flask. When the Optical Density (at 560 nm) of the 
culture reached 0.4-0.6, the cells were induced with IPTG (1 mM). Four hours 
after induction with IPTG, the cells were harvested by centrifugation. The cells 
were then washed with phosphate buffered saline and centrifuged again. The 
supernatant was discarded and the cells were either frozen for future use or 

15 immediately processed. Twenty ml of lysis buffer was added to the cell pellets 
and vortexed. To break open the E. coli cells, this mixture was then run 
through the French Press at a pressure of 16,000 psi. The cells were then 
centrifuged again and the supernatant and pellet were checked by SDS-PAGE 
for the partitioning of the recombinant protein. For proteins that localized to the 

20 cell pellet, the pellet was resuspended in 1 0 mM Tris pH 8.0, 1 % CHAPS and 
the inclusion body pellet was washed and centrifuged again. This procedure 
was repeated twice more. The washed inclusion body pellet was solubilized 
with either 8 M urea or 6 M guanidine HCI containing 10 mM Tris pH 8.0 plus 
10 mM imidazole. The solubilized protein was added to 5 ml of nickel-chelate 

25 resin (Qiagen) and incubated for 45 min to 1 hour at room temperature with 
continuous agitation. After incubation, the resin and protein mixture were 
poured through a disposable column and the flow through was collected. The 
column was then washed with 10-20 column volumes of the solubilization 
buffer. The antigen was then eluted from the column using 8M urea, 1 0 mM 

30 Tris pH 8.0 and 300 mM imidazole and collected in 3 ml fractions. A SDS- 
PAGE gel was run to determine which fractions to pool for further purification. 
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As a final purification step, a strong anion exchange resin such as 
HiPrepQ (Biorad) was equilibrated with the appropriate buffer and the pooled 
fractions from above were loaded onto the column. Antigen was eluted off the 
column with a increasing salt gradient. Fractions were collected as the column 
5 was run and another SDS-PAGE gel was run to determine which fractions from 
the column to pool. The pooled fractions were dialyzed against 10 mM Tris pH 
8.0. The protein was then vialed after filtration through a 0.22 micron filter and 
the antigens were frozen until needed for immunization. 

Four hundred micrograms of B305D antigen was combined with 

10 1 00 micrograms of muramyldipeptide (MDP) . Every four weeks rabbits were 
boosted with 100 micrograms mixed with an equal volume of Incomplete 
Freund's Adjuvant (IFA). Seven days following each boost, the animal was 
bled. Sera was generated by incubating the blood at 4 °C for 1 2-24 hours 
followed by centrifugation. 

15 Ninety-six well plates were coated with B305D antigen by 

incubating with 50 microliters (typically 1 microgram) of recombinant protein at 
4 °C for 20 hours. 250 microliters of BSA blocking buffer was added to the 
wells and incubated at room temperature for 2 hours. Plates were washed 6 
times with PBS/0.01 % Tween. Rabbit sera was diluted in PBS. Fifty microliters 

20 of diluted sera was added to each well and incubated at room temperature for 
30 min. Plates were washed as described above before 50 microliters of goat 
anti-rabbit horse radish peroxidase (HRP) at a 1 :10000 dilution was added and 
incubated at room temperature for 30 min. Plates were again washed as 
described above and 100 microliters of TMB microwell peroxidase substrate 

25 was added to each well. Following a 1 5 min incubation in the dark at room 
temperature, the colorimetric reaction was stopped with 100 microliters of 1N 
H 2 S0 4 and read immediately at 450 nm. The polyclonal antibodies showed 
immunoreactivity to B305D. 

Immunohistochemical (IHC) analysis of B305D expression in 

30 breast cancer and normal breast specimens was performed as follows. 
Paraffin-embedded formal fixed tissue was sliced into 8 micron sections. 
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Steam heat induced epitope retrieval (SHIER) in 0.1 M sodium citrate buffer 
(pH 6.0) was used for optimal staining conditions. Sections were incubated 
with 10% serum/PBS for 5 minutes. Primary antibody was added to each 
section for 25 min at indicated concentrations followed by a 25 min incubation 

5 with either an anti-rabbit or anti-mouse biotinylated antibody. Endogenous 
peroxidase activity was blocked by three 1.5 min incubations with hydrogen 
peroxide. The avidin biotin complex/horseradish peroxidase (ABC/HRP) 
systems was used along with DAB chromagen to visualize antigen expression. 
Slides were counterstained with hematoxylin. B305D expression was detected 

10 in both breast tumor and normal breast tissue. However, the intensity of 
staining was much less in normal samples than in tumor samples and surface 
expression of B305D was observed only in breast tumor tissues. 

A summary of real-time PCR and immunohistochemical analysis 
of B305D expression in an extensive panel of normal tissues is presented in 

15 Table 3 below. These results demonstrate minimal expression of B305D in 
testis, inconclusive results in gall bladder, and no detection in all other tissues 
tested. 



Table 3 



mRNA 


IHC staining 


Tissue type 


Summary 


Moderately 
positive 


Positive 


Testis 


Nuclear staining of small 
minority of spermatids; 
spermatozoa negative; 
siminoma negative 


Negative 


Negative 


Thymus 


No expression 


N/A 


Negative 


Artery 


No expression 


Negative 


Negative 


Skeletal muscle 


No expression 


Negative 


Positive (weak 
staining) 


Small bowel 


No expression 


Negative 


Positive (weak 
staining) 


Ovary 


No expression 


Neqatlve 




Pituitary 


No expression 


Negative 


Positive (weak 
staining) 


Stomach 


No expression 


Negative 


Negative 


Spinal cord 


No expression 


Negative 


Negative 


Spleen 


No expression 
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mRNA 


IHC staining 


Tissue type 


Summary 


Neqative 


Negative 


Ureter 


No expression 


N/A 


Negative 


Gall bladder 


Inconclusive 


N/A 


Negative 


Placenta 


No expression 


Negative 


Negative 


Thyroid 


No expression 


Negative 


Negative 


Heart 


No expression 


Neqative 


Negative 


Kidney 


No expression 


Negative 


Negative 


Liver 


No expression 


Negative 


Negative 


Brain- 
cerebellum 


No expression 


Neqative 


Negative 


Colon 


No expression 


Negative 


Negative 


Skin 


No expression 


Negative 


Negative 


Bone marrow 


No expression 


£2 

N/A 


Negative 


Parathyroid 


No expression 


Negative 


Negative 


Lung 


No expression 


Negative 


Negative 


Esophagus 


No expression 


Negative 


Positive (weak 
staining) 


Uterus 


No expression 


Negative 


Negative 


Adrenal 


No expression 


Negative 


Negative 


Pancreas 


No expression 


N/A 


Negative 


Lymph node 


No expression 


Negative 


Negative 


Brain-cortex 


No expression 


N/A 


Negative 


Fallopian tube 


No expression 


Negative 


Positive (weak 
staining) 


Bladder 


No expression 


Negative 


N/A 


Bone 


No expression 


Negative 


N/A 


Salivary gland 


No expression 


Negative 


N/A 


Activated PBMC 


No expression 


Negative 


N/A 


Resting PBMC 


No expression 


Negative 


N/A 


Trachea 


No expression 


Negative 


N/A 


Vena cava 


No expression 


Negative 


N/A 


Retina 


No expression 


Negative 


N/A 


Cartilage 


No expression 



EXAMPLE 8 
Protein Expression of Breast Tumor Antigens 

This example describes the expression and purification of the 
breast tumor antigen B305D in £ coli and in mammalian cells. 

Expression of B305D isoform C-15 (SEQ ID NO:301 ; translated to 
384 amino acids) in £. coli was achieved by cloning the open reading frame of 
B305D isoform C-15 downstream of the first 30 amino acids of the M. 
tuberculosis antigen Ra12 (SEQ ID NO:318) in pET17b. First, the internal 
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EcoRI site in the B305D ORF was mutated without changing the protein 
sequence so that the gene could be cloned at the EcoRI site with Ra12. The 
PCR primers used for site-directed mutagenesis are shown in SEQ ID NO:319 
(referred to as AW012) and SEQ ID NO:320 (referred to as AW013). The ORF 
5 of EcoRI site-modified B305D was then amplified by PCR using the primers 
AW014 (SEQ ID NO:321) and AW015 (SEQ ID NO:322). The PCR product 
was digested with EcoRI and ligated to the Ra12/pET17b vector at the EcoRI 
site. The sequence of the resulting fusion construct (referred to as 
Ra12mB1 1C) was confirmed by DNA sequencing. The determined cDNA 
1 0 sequence for the fusion construct is provided in SEQ ID NO:323, with the 
amino acid sequence being provided in SEQ ID NO:324. 

The fusion construct was transformed into BL21 (DE3)CodonPlus- 
RIL E. coli (Stratagene) and grown overnight in LB broth with kanamycin. The 
resulting culture was induced with IPTG. Protein was transferred to PVDF 
1 5 membrane and blocked with 5% non-fat milk (in PBS-Tween buffer), washed 
three times and incubated with mouse anti-His tag antibody (Clontech) for 1 
hour. The membrane was washed 3 times and probed with HRP-Protein A 
(Zymed) for 30 min. Finally, the membrane was washed 3 times and 
developed with ECL (Amersham). Expression was detected by Western blot. 
20 For recombinant expression in mammalian cells, B305D isoform 

C-15 (SEQ ID NO:301; translated to 384 amino acids) was subcloned into the 
mammalian expression vectors pCEP4 and pcDNA3.1 (Invitrogen). These 
constructs were transfected into HEK293 cells (ATCC) using Fugene 6 reagent 
(Roche). Briefly, the HEK cells were plated at a density of 100,000 cells/ml in 
25 DMEM (Gibco) containing 10% FBS (Hyclone) and grown overnight. The 
following day, 2 ul of Fugene 6 was added to 100 ul of DMEM containing no 
FBS and incubated for 1 5 minutes at room temperature. The Fugene 6/DMEM 
mixture was added to 1 ug of B305D/pCEP4 or B305D/pcDNA plasmid DNA 
and incubated for 1 5 minutes at room temperature. The Fugene/DNA mix was 
30 then added to the HEK293 cells and incubated for 48-72 hours at 37 °C with 
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7% C0 2 . Cells were rinsed with PBS, the collected and pelleted by 
centrif ligation. 

For Western blot analysis, whole cell lysates were generated by 
incubating the cells in Triton-X1 00 containing lysis buffer for 30 minutes on ice. 

5 Lysates were then cleared by centrifugation at 1 0,000 rpm for 5 minutes at 4 
°C. Samples were diluted with SDS-PAGE loading buffer containing beta- 
mercaptoethanol, and boiled for 1 0 minutes prior to loading the SDS-PAGE gel. 
Proteins were transferred to nitrocellulose and probed using Protein A purified 
anti-B305D rabbit polyclonal sera (prepared as described above) at a 

1 0 concentration of 1 ug/ml. The blot was revealed with a goat anti-rabbit Ig 

coupled to HRP followed by incubation in ECL substrate. Expression of B305D 
was detected in the HEK293 lysates transfected with B305D, but not in control 
HEK293 cells transfected with vector alone. 

For FACS analysis, cells were washed further with ice cold 

1 5 staining buffer and then incubated with a 1 :1 00 dilution of a goat anti-rabbit Ig 
(H+L)-FITC reagent (Southern Biotechnology) for 30 minutes on ice. Following 
3 washes, the cells were resuspended in staining buffer containing Propidium 
Iodide (PI), a vital stain that allows for identification of permeable cells, and 
then analyzed by FACS. The FACS analysis showed surface expression of 

20 B305D protein. 

EXAMPLE 9 

Expression of Full-Length B305D In Insect Cells 
Using a Baculovirus Expression System 

The cDNA for the full-length breast tumor antigen, B305D isoform 
25 C (SEQ ID NO:301 ), with a C-terminal His Tag was made by PCR using 
B1 1C15/pBib as a template and the following primers: 
B305DF1 (SEQ ID NO:337): 

5'CGGCGGATCCACCATGGTGGTTGAGGTTGATTCC 
B305DRV1 (SEQ ID NO:338): 
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5'CGGCTCTAGATTAATGGTGATGGTGATGATGATGGTGATG 
ATGTTTATTTCTGGTTCTTGAGACATTTTCTGGA. 

The PCR product with the expected size was recovered from an 
agarose gel, digested with the Bam HI and Xba I restriction enzymes, and 
ligated into the transfer plasmid pFastBad which was digested with the same 
restriction enzymes. The sequence of the insert was confirmed by DNA 
sequencing and is set forth in SEQ ID NO:335. The predicted amino acid 
sequence of B305D with the C-terminal His tag is set forth in SEQ ID NO:336. 
The recombinant transfer plasmid pFBB305D was used to make recombinant 
bacmid DNA and virus by the Bac-To-Bac baculovirus expression system 
(Invttrogen Life Technologies, Carlsbad, CA). The recombinant BVB305D virus 
was amplified in Sf9 insect cells and used to infect High Five insect cells. 
Infected cells were harvested at 24-30 hours post-infection. The identity of the 
recombinant protein was confirmed by Western blot with a rabbit polyclonal 
antibody against B305D. Recombinant protein was further analyzed by SDS- 
PAGE followed by Coomassie blue staining. 

EXAMPLE 10 

Identification of an Additional B305D Homolog Discovered 

BY BlOINFORMATIC SEARCH 

20 The High Throughput Genome Sequencing (HTGS) database 

was searched with the B305D C form sequence (SEQ ID NO:301) and 
revealed another highly related copy of the B305D gene, tentatively localized to 
Chromosome 14. The sequences identified were spliced together based on 
the B305D C form sequence and exon-intron splice sites. This predicted cDNA 
25 sequence (SEQ ID NO:339) was translated to generate the predicted amino 
acid sequence (SEQ ID NO:340). The B305D gene family members have been 
shown to be overexpressed in breast cancer, prostate cancer, and ovarian 
cancer. 



124 



10 



wnn^ni343l fhttp ://www. q etth e p a tenU^ 



WO 03/013431 PCT/US02/24917 

EXAMPLE 11 

IMMUNOHISTOCHEMICAL (IHC) ANALYSIS OF B305D EXPRESSION 

Analysis suggests that B305D is a type II plasma membrane 
protein of about 43 kDa with 1 predicted trasmembrane spanning domain. 
5 There are no glycosylation sites and its function remains unknown. Disclosed 
herein is further examination of B305D expression by immunohistochemistry 
(IHC) analysis in a variety of tumor and normal tissues. 

Methods and Materials: 

In order to determine which tissues express the breast cancer 

1 0 antigen B305D, IHC analysis was performed on a diverse range of tissue 
sections. Tissue samples were fixed in formalin solution for 12-24 hours and 
embedded in paraffin before being sliced into 8 micron sections. Steam heat 
induced epitope retrieval (SHIER) in 0.1 M sodium citrate buffer (pH 6.0) was 
used for optimal staining conditions. Sections were incubated with 1 0% 

15 serum/PBS for 5 minutes. Primary antibody was added to each section for 25 
minutes at indicated concentrations followed by 25 minute incubation with anti- 
rabbit biotinylated antibody. Endogenous peroxidase activity was blocked by 
three 1 .5 minute incubations with hydrogen peroxidase. The avidin biotin 
complex/horse radish peroxidase (ABC/HRP) system was used along with DAB 

20 chromogen to visualize antigen expression. Slides were counterstained with 
hematoxylin to visualize cell nuclei. 

Rabbit polyclonal antibodies against B305D were shown in 
Example 7 to react in formalin fixed, paraffin-embedded tissues. The antibody 
was shown to label the plasma membrane of a subset of breast carcinomas. 

25 B305D was shown to label tissues that were positive for cerb-2, also called 
Her-2/neu. HER-2/neu (p185) is the protein product of the HER-2/neu 
oncogene. The HER-2/neu gene is amplified and the HER-2/neu protein is 
overexpressed in a variety of cancers including breast, ovarian, colon, lung, 
prostate and hematological cancers. HER-2/neu is related to malignant 

30 transformation and is found in 50%-60% of ductal in situ carcinoma and 20%- 
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40% of all breast cancers, as well as a substantial fraction of adenocarcinomas 
arising in the ovaries, prostate, colon and lung. HER-2/neu is intimately 
associated not only with the malignant phenotype, but also with the 
aggressiveness of the malignancy, being found in one-fourth of all invasive 
5 breast cancers. HER-2/neu overexpression is correlated with a poor prognosis 
in both breast and ovarian cancer. In this study breast carcinomas were tested 
from two age groups; women under 50 at the time of tumor removal and 
women over 50 at the time of tumor removal. B305D staining was evaluated 
for each. In addition to breast carcinomas ovarian carcinomas, normal 
1 0 pancreas, normal kidney and normal stomach were tested for B305D reactivity. 

Formalin-fixed, paraffin-embedded breast carcinomas from 23 
different patients were tested for B305D reactivity. The age of the patient at 
the time of tumor removal was available in all cases to determine whether 
patient age is associated with B305D staining. In many cases, estrogen 
1 5 receptor/progesterone receptor (ER/PR) data and cerb2 data was available 
from the pathology reports. Breast patients were chosen simply based on age. 
These patients in the Vounger' group are close to the age of 40. We also 
obtained tumors from patients that were closer to the age of 70. This group is 
referred to as the 'older' group. 
20 In addition to breast carcinomas, 1 7 different ovarian carcinomas 

were immunohistochemically analyzed for B305D staining. Five samples each 
of normal stomach, kidney and pancreas were also tested. For most of the 
tissues, the B305D antibody was tested with two different detection systems, 
one with ABC as the Horseradish Peroxidase (HRP) enzyme-linked reagent 
25 and another with strept-avidin as the HRP reagent. In all cases, rabbit IgG was 
run as a negative control in parallel with the B305D antibody. B305D was 
tested at 2.5 ng/ml using SHIER II heat pretreatment. Breast carcinoma multi- 
tissue block, QMTB21 , was used as a positive control for the antibody. Tumor 
#5 in the block was previously shown to label with a membrane pattern with the 
30 B305D antibody. 
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Results: Breast Carcinomas (results shown in Table 4) 

The avidin-biotin complex (ABC) stained slides were lighter than 
expected, although membrane staining was detected in the positive control. To 
make sure that no positive staining was overlooked, the slides were tested with 

5 the strept-avidin (SA) detection. Upon the analysis of the ABC slides, only one 
tumor labeled with a membrane pattern. This tumor was from a 42 yr old 
patient who also demonstrated membrane staining for cerb2. When retested 
with SA, an older patient that was cerb2 membrane positive was included. This 
tumor was from an 80 yr old patient. Breast cancer staining results are outlined 

10 in Table 4 below. The staining data presented in tables 4-6 is from the SA- 
HRP staining. The B305D antibody labels breast carcinomas in the cytoplasm 
and on the plasma membrane. Membrane staining is limited to tumor cells, 
whereas cytoplasmic staining is also often present in the normal ductal 
epithelium. Among the SA labeled tissues, only the positive control and the 42 

1 5 yr old and the 80 yr old that were cerb2 positive labeled membrane positive for 
B305D. Two other cases labeled with light membrane staining in a minority of 
tumor cells. One case was from a 28 yr old patient, the other from a 73 yr old 
patient; cerb2 status was not available for either of these cases. The limited 
staining in these two cases with lighter staining may be due to tissue fixation as 

20 positive cells were found on the periphery of the tissue. 

Thus, 4 cases of 23 (less than 20%) labeled with a membrane 
pattern for B305D. Less than 10% of the tumors (2 of 23) labeled with 
definitive membrane staining. In a previous random study, 3 of 15 cases 
demonstrated membrane staining for B305D. Cerb2 data was not available for 

25 all of the tissues tested but for the two cases that were definitively positive for 
B305D, both were strongly positive for cerb2. B305D membrane positive cases 
were split evenly across the younger' and 'older* groups. The younger group 
included 1 1 patients under 50 and the older group included 12 patients 50 or 
older. Of this older group, 9 of the patients are 66 or older, and 7 were in their 

30 70's and 80's (one tumor from a 50 year old had only a small amount of tumor 
in the block and may be discounted - thus 4 of 22 positive). ER/PR data was 
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available for most cases but no association with B305D could be determined. 
Thus, based on this and previous IHC data, B305D expression is closely 
associated with cerb2 expression. Further B305D testing of cerb2 positive 
breast tumors may strengthen this correlation. From the results of this study, 
5 patient age at the time of tumor removal does not appear to correlate with 
B305D staining. 

Table 4 

AGE RELATED B305D REACTIVITY IN BREAST CARCINOMAS 



Accession 
No. 


Age 


□OwvU inv 

Reactivity 


Diaanosis 


ERVPR Status 


S86-2763 
(slide 1) 




PufAnhcmir^ crtalninn 
Oylupiaol 1 llu olaii in iy 


Infiltratina Ductal 


ER/PR negative 


S00-9327 
(slide 2) 


28 


Marginal membrane 
siaining 


Infiltrating 
Lobular 


N/A 


S00-4786 
(slide 3) 


43 


Light cytoplasmic 
staining 


Infiltrating Mixed 
Ductal/Lobular 


ER positive 2-3+ 
PR positive 2-3+ 
Cerb2 Negative 1 + 


S86-1877 
(slide 4) 


40 


Cytoplasmic staining 


Infiltrating Ductal 


ER positive 
PR strongly 
positive 


S84-2015 
(slide 5) 


40 


Light cytoplasmic 
staining 


Infiltrating Ductal 


N/A 


S88-1981 
(slide 6) 


40 


Cytoplasmic staining 


Infiltrating Ductal 


N/A 


S84-2915 
(slide 7) 


38 


Light cytoplasmic 
staining 


Infiltrating Ductal 


ER strongly 
positive 
PR positive 


S86-1510 
(slide 8) 


41 




Infiltrating Ductal 


ER positive 
PR strongly 
positive 


S01-31 
(slide 9) 


42 


Membrane staining; 
cytoplasmic staining 


Infiltrating Ductal 


Cerb2 positive 3+ 


S84-855 
(slide 10) 


48 


Light cytoplasmic 
staining 


Infiltrating ducal 


ER Positive 
PR strongly 
positive 


00-1826 
(slide 50) 


46 


Light cytoplasmic 
staining 


Infiltrating ducal 


ER-positive 3+ 
PR-positive 3+ 
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Accession 
No. 


Age 


RlftRn IMP 
DOUOU Inv 

Reactivity 


Diaanosis 


ER/PR Status 


S00-2297 
(slide 11) 


50 


Light cytoplasmic 
staining 


Infiltrating ducal 


ER-negative 

PR-IYlftitlVA 1 4- 

Cerb2 negative 1 + 


S00-3232A 
(slide 12) 


50 


Light cytoplasmic 
staining (very little 
tumor) 


Infiltrating ducal 


ER-positive 3+ 
Cerb2-negative 1 + 


SOO-8096 
(slide 13) 


54 




Infiltrating ducal 


ER-Negative 
PR-Negative 

PorHO-nonsitK/ft 1 4> 
OtJiL«£ lloyauvo 1 1* 


S00-2097 
(slide 14) 


66 


Very little tumor 


Infiltrating ducal 


ER-positive 3+ 
PR-positive 2-3+ 

PorhO-nonatix/A 94- 
L/ciU& -noyciuvo ct 


S88-2476 
(slide 15) 


79 




Infiltrating ducal 


ER-strongly 
positive 

DR.ctrnnnK/ 
rn-buoiiyiy 

positive 


S88-2551 
(slide 16) 


81 


Very light cytoplasmic 
staining 


Infiltrating ducal 


ER-strongly 
positive \ 

PD rv"*citi\/o 

rn-puoiiivc 


S88-2665 
(slide 17) 


73 


Marginal membrane 
staining; cytoplasmic 
staining 


Infiltrating ducal 


ER-positive 
PR-negative 


S88-2476 
(slide 18) 


79 


Light membrane 
staining 


Infiltrating ducal 


ER-strongly 
positive 
PR-strongly 
positive 


S00-2491 
(slide 19) 


77 


Light cytoplasmic 
staining 

Little tumor present 


Lobular 
Infiltrating 


ER-positive 1-3+ 
PR-positive 1-3+ 
oerufc n©ga.u vo ot 


S85-2667 
(slide 20) 


68 


Cytoplasmic staining 


Infiltrating ducal 


ER-strongly 
positive 
PR-strongly 
positive 


00-6606A 
(slide 49) 


80 


Membrane staining; 
cytoplasmic staining 


Infiltrating ducal 


ER-negative 
PR-negative 
Cerb2-positive 3+ 


S88-1146 
(slide 50, in 
| boxl) 


88 


Light cytoplasmic 
staining 


Infiltrating ducal 


ER-strongly 

positive 

PR-negative 
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Ovarian Carcinomas (results outlined in Table 5) 

None of the 17 ovarian carcinomas tested with the B305D 
antibody labeled with a membrane pattern. About half of the tissues labeled 
with a cytoplasmic staining pattern. 



5 Table 5 

B3n5D STAINING OF OVARIAN CARCINOMAS 





Tissue (slide #) 


Age 


Diagnosis 


IHC 

Reactivity/Comments 


1. 


73-1808 (slide 
37) 


73 


Papillary mucinous 
adenocarcinoma 




2. 


76-1076 (slide 
38) 


50 


Serous adenocarcinoma 




3. 


81-1910 (slide 
39) 


51 


Serous adenocarcinoma 


Cytoplasmic staining; not 
uniform 


4. 


88-220 (slide 40) 


40 


Mucinous 

cystadenocarcinoma 


Light cytoplasmic 
staining 


5. 


88-2207 (slide 
41) 


75 


Papillary Serious 
cystadenocarcinoma 




6. 


88-2527 (slide 
42) 


29 


Malignant teratoma 


Light cytoplasmic 
staining; not uniform 


7. 


00-5294 (slide 
43) 


55 


Papillary 
adenocarcinoma 


Light cytoplasmic 
staining 


8. 


84-779 (slide 44) 


48 


Endometriod carcinoma 


Light cytoplasmic 
staining 


9. 


84-1843 (slide 
45) 


32 


Papillary serious 
adenocarcinoma 


Cytoplasmic staining 


10. 


85-2373 (slide 
46) 


47 


Granulosa cell tumor 


Light cytoplasmic 
staining 


11. 


86-813 (slide 47) 


74 


Clear cell carcinoma 




12. 


QMTB#26 (slide 
48) 




Five different ovarian 
carcinomas 


All negative 



normal Tissues (results out i ined in Table 6) 

Of the five stomach cases tested, all had staining above 
background in the glands below the gastric epithelium. Staining was 
cytoplasmic and grainy and was present with both detection systems. There 
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was some staining in the negative control but this staining was diffuse and not 
grainy. Background staining was common in these cells. The B305D staining 
appeared to be due to the antibody binding and not the detection system. 

Five different kidney cases were tested. The medulla region was 
5 represented in each case. There was staining in the tubules throughout the 
kidney, but this appears to be due to endogenous biotin as similar but lighter 
staining was present in the negative controls. There was much less staining in 
the ABC stained slides compared with the strept-avidin slides, which is also 
consistent with endogenous biotin. The SHIER II pretreatment required to 

1 0 obtain staining with the antibody tended to give more background staining, 
particularly due to endogenous biotin. 

Of the five different pancreas tissues tested, no specific staining 
was detected. A subset of acinar cells gave staining in both the B305D and the 
rabbit IgG control. Once again this staining was non-specific. Pancreas often 

1 5 gave non-specific staining, possibly due to the enzymatic activity of the tissue. 

A variety of other normal tissues (not shown in Table 6) were 
tested including skin, testis, colon, heart, thymus, artery, skeletal muscle, small 
bowel, pituitary, spinal cord, spleen, ureter, gall bladder, placenta, thyroid, liver, 
brain-cerebellum, bone marrow, parathyroid, lung esophagus, uterus, adrenal, 

20 lymph node, brain-cortex, fallopian tude, bladder, and prostate. Weak IHC 
staining was observed in small bowel, uterus, and bladder. However, no 
mRNA expression was seen in these tissues. Thus, this weak staining likely 
does not represent protein expression in these tissues. The gall bladder 
stained positive and will be analyzed further. Half of the prostate samples 

25 stained positive as well as the single testis sample examined. 

B305D expression was also analyzed in prostate tumor samples. 
One of 5 grade 3+3 samples stained positive while none of the grade 3+4 
samples stained positive. One additional sample of 3 unknown grade samples 
stained positive. However, an additional array of 55 primary and primary 

30 metastatic prostate tumor samples was tested and no staining was observed. 
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Table 6 

B305D STAINING OF OTHER TISSUES (NORMAL KIDNEY . STOMACH AND 

PANCREAS) 





Tissue (Slide #) 


B305D IHC 
Reactivity 


Comments 




Stomach 






1 

1 • 


Rlk 85-568 (slide 
22) 


cvtomolasmic 


Grainy cytomplasmic staining of 
glands below epithelium (not in neg 
control) 


2. 


Blk 85-587 (slide 
23) 


cytomplasmic 


Graining staining of glands below 
epithelium, some background in 
negative control 


3. 


Blk 85-1206 (slide 
24) 


cytomplasmic 


Graining staining of glands below 
epithelium, lighter background in 
negative control 


4. 


Blk 85-1225 (slide 
25) 


cytomplasmic 


Marginal staining 


5. 


Blk 85-1426 (slide 
26) 


cytomplasmic 


Grainy staining of glands below 
epithelium, some background in 
negative control 




Kidney 






1. 


Blk 00-7008 (slide 
27) 


Inconclusive 
(most likely 
negative) 


Staining of tubules; also present in 
neg control (lighter) - mostly likely 
due to endogenous biotin 


2. 


Blk 00-5638 (slide 
28) 


Same as above 


Same as above 


3. 


Blk 00-1711 (slide 
29) 


Same as above 


Same as above 


4. 


Blk 00-3859 (slide 
30) 


Same as above 


Same as above 


5. 


Blk 00-7651 (slide 
3D 


Same as above 


Same as above 
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Tissue (Slide #) 


B305D IHC 
Reactivity 


Comments 




Pancreas 






1. 


Blk Q965 (slide 32) 


Negative 


Non-specific staining in negative 
control 


2. 


Blk 00-2287 (slide 
33) 


Negative 


Non-specific staining in negative 
control 


3. 


Blk 00-2790 (slide 
34) 


Negative 


Non-specific staining in negative 
control 


4. 


Blk 00-6899 (slide 
35) 


Negative 


Non-specific staining in negative 
control 


5. 


Blk 00-7053 (slide 
36) 


Negative 


Non-specific staining in negative 
control 



In summary, B305D was only observed in less than 20% of breast 
carcinomas. Staining was observed in half of the normal prostate samples 
however, membrane staining was not detected in normal breast, in ovarian 
5 carcinomas or in normal pancreas, kidney, stomach or a panel of other normal 
tissues. 



EXAMPLE 12 

Analysis of Breast-tumor Specific B305D Sequences 
Numerous forms of the breast tumor antigen, B305D have been 
1 0 isolated. To date, isoforms A (DNA SEQ ID NO:291 , 292, 296, 313, 31 4) A 
variant (DNA SEQ ID NO:299), B (DNA SEQ ID NO:294, 297), and C (DNA 
SEQ ID NO:295, 301 , 302, 303) have been identified. Using B305D gene 
specific 5' and 3' primers representing all known forms of B305D, specific forms 
of this gene expressed in breast tumors were amplified. Disclosed herein in 
1 5 SEQ ID NO:341 -348 are 4 D305D nucleotide sequences and their 

corresponding amino acid sequences identified specifically in breast tumors as 
described below. 

Two PCR reactions were carried out using primers specific to 
B305D. The products were then analyzed and full-length sequences were 
20 compiled. For the first reaction, primers were designed to regions common to 
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all B305D forms near the 5* and 3' ends of the gene. The second set of PCR 
reactions used primers specific to each of the start sites specific to each of the 
forms. Three 5' primers were designed to amplify from the B305D A form, A 
form frameshift and C form start sites. 3' reverse primers were designed to a 
5 common region of all B305D forms, slightly upstream of the 3' primer used in 
the first PCR reaction. PCR was carried out using these primers and cDNA 
derived from breast tumor RNA numbers 443, 23B, and S76. All products were 
sequenced, analyzed and compiled. 

Two variants of the B305D A isoform were identified in the breast 
1 0 tumor samples. The nucleotide sequence of these 2 variants is set forth in 
SEQ ID NO:341 and 342 and the corresponding amino acid sequence is set 
forth in SEQ ID NO:345 and 346. One of these variants (SEQ ID NO:341) is 
identical to a previously identified variant of B305D A isoform described in 
Example 1 and set forth in SEQ ID NO:314. The other variant (SEQ ID 
1 5 NO:342) differs from SEQ ID NO:31 4 by 2 base pairs and encodes an amino 
acid sequence (SEQ ID NO:346) that differs by one amino acid from the 
previously identified A isoform set forth in SEQ ID NO:315. 

Two new variants of the B305D C isoform were also identified 
from the breast tumor samples. The nucleotide sequence of these two variants 
20 is provided in SEQ ID NO:343 and 344 and the corresponding amino acid 
sequence is set forth in SEQ ID NO:347 and 348. The 5' end of the 2 C 
isoform variants appears to be a truncated C isoform that is missing one of the 
two 4 base pair repeats normally seen in the C isoform. The 3" end of these 
variants aligns well to the A isoforms. More specifically, there is a splice 
25 junction at around base 297. It is at this junction where SEQ IDs 343 and 344 
diverge from the standard C form and the remaining 3' end being the A form. 
Upstream (5" of) of this junction the sequence of B305D isoforms set forth in 
SEQ ID NO:343 and 344 are missing 1 1 1 base pairs of standard B305D C form 
respeat sequence. The variant set forth in SEQ ID 343 is the shortest, having 
30 an additional 6 base pair deletion in the large missing repeat. Thus, in 

summary, SEQ ID NO:343 and 344 begin with the ATG of the standard B305D 
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C isoform. The sequence continues as the C isoform for about 185 base pairs 
for SEQ ID NO:344 and 1 79 base pairs for SEQ ID NO:343. Both sequences 
then have about a 1 12 base pair deletion of repeat sequence just prior to the 
splice junction. Following the splice junction, both variants follow the A form. 

5 EXAMPLE 13 

Identification of CD4 T Cell Epitopes For B305D 

This example demonstrates the identification of CD4+ T cell 
epitopes of the C form of B305D (full-length cDNA and amino acid sequence of 
B305D are set forth in SEQ ID NO:301 and 304, respectively). 
1 o CD4+ T cell responses were generated using PBMC of normal 

donors using dendritic cells (DC) pulsed with overlapping 20-mer peptides 
spanning the entire B305D C isoform protein. Briefly, CD4+ T cells were 
stimulated 3-4 times with DC pulsed with a mixture of overlapping peptides in 
IMDM media containing IL-6 and IL-12 in the primary stimulation, and IL-2 + IL- 
15 7 in all other stimulations. These lines were subsequently assayed using a 
standard proliferation assay (measuring tritiated thymidine uptake) for reactivity 
with the priming peptides or recombinant £ coli derived B305D. 

A number of different peptides elicited B305D specific T cells. 
These CD4+ T cell epitopes are contained in the following sequences: 
20 VNKKDKQKRTALHLASANGNSEWKLLLDR (SEQ ID NO:349): 

(peptides 34-46 corresponding to amino acids 166-195 of SEQ ID NO:304). 

ALHLASANGNSEWKLLLDRRCQLNVLDNK (SEQ ID NO:350) 
(peptides 36-38 corresponding to amino acids 176-205 of SEQ ID NO:304). 

GSASIVSLLLEQNIDVSSQDLSGQT (SEQ ID NO:351) (peptides 
25 64-65 corresponding to amino acids 31 6-340 of SEQ ID NO:304). 

CD4+ T cells recognizing these peptides also recognize 
recombinant B305D protein, suggesting that these are naturally processed 
epitopes. Two of these lines (lines 31 .9 and 31 .1 0 recognizing peptides set 
forth in SEQ ID NO:349 and 350) also recognized mammalian sources of 
30 B305D including baculovirus protein, lysates from HEK cells transiently 
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transfected with B305D and lysates from cells infected with adenovirus 
expressing 6305D. 

Thus, these studies demonstrate that CD4+ T cell immunity to 
B305D can be elicited and identify the peptides set forth in SEQ ID NO:349- 
5 351 as immunogenic, naturally processed CD4+ T cell epitopes. 

EXAMPLE 14 

Autoantibodies to B305D In Breast Cancer Sera and 
Epitope Mapping of the Antigenic Sites 

Autoantibodies to specific B305D peptide epitopes were identified 

1 0 in the sera of breast cancer patients. Overlapping peptides spanning the entire 
B305D sequence (cDNA and amino acid sequence of the C form of B305D set 
forth in SEQ ID NO:301 and 304, respectively) were synthesized and tested by 
ELISA with sera from patients with breast cancer to determine the presence of 
B305D-specific antibodies. Several immunoreactive regions were identified, 

1 5 including immunodominant regions encompasssing the ankyrin repeat portion 
of the molecule. 

Seventy-four 20-mer peptides overlapping by 1 5 amino acids, 
spanning the entire open reading frame of B305D were synthesized (amino 
acid sequences set forth in SEQ ID NO:352-425). These 74 peptides were 

20 tested in ELISA to evaluate which epitopes reacted with breast cancer sera as 
well as control sera. Initially peptides were pooled and tested to locate regions 
of activity. Highest activity was obtained in peptides 1-24 (SEQ ID NO:352- 
375) and these were retested individually to determine the specific epitopes. 
Peptides 3,5,6,11,13,19 and 20 (SEQ ID NO:354, 356, 357, 362, 364, 370, 

25 371 , respectively) were then further tested with a complete panel of 74 breast, 
50 ovarian and 55 prostate cancer sera as well as controls. 1 8 of 74 breast 
cancer sera were reactive with one or more peptides. Both breast and ovarian 
cancer sera showed reactivity and active epitopes appeared located in the 
ankyrin repeat regions of B305D. The amino acid sequence of the 3 ankyrin 

30 repeat sequences found in B305D are set forth in SEQ I D NO:426-428 and are 

136 



WnnvM343l fhttp://Mww.qetthapatent.com/Log in.^ 



WO 03/013431 PC17US02/24917 

present within the overiapping peptides set forth in SEQ ID NO:356-359, 363- 
366, and 368-376, respectively. 

Detection of autoantibodies to B305D in breast cancer sera 
indicates that such patients can elicit an immune response to specific epitopes 

5 and indicates that B305D can be used either alone or in combination with other 
breast tumor antigens as a target for vaccine development. Knowing that 
antibodies to B305D are present in the serum of breast cancer patients 
strengthens the potential use of this antigen as a vaccine target. In addition, 
detection of antibodies to B305D can be used as a diagnostic for breast cancer 

10 alone or in combination with detecting antibodies to other antigens, e.g. , Her- 
2/neu or other tumor antigens. The presence of antibodies to B305D also 
indicates that B305D antigen is present in serum and could be used as a target 
for development of a specific antigen detection assay. 

EXAMPLE 15 

1 5 Analysis of cDNA Expression Using Microarray Technology 

In additional studies, sequences disclosed herein are evaluated 
for overexpression in specific tumor tissues by microarray analysis. Using this 
approach, cDNA sequences are PCR amplified and their mRNA expression 
profiles in tumor and normal tissues are examined using cDNA microarray 

20 technology essentially as described (Shena, M. ef a/., 1 995 Science 270:467- 
70). In brief, the clones are arrayed onto glass slides as multiple replicas, with 
each location corresponding to a unique cDNA clone (as many as 5500 clones 
can be arrayed on a single slide, or chip). Each chip is hybridized with a pair of 
cDNA probes that are fluorescence-labeled with Cy3 and Cy5, respectively. 

25 Typically, 1 \>Q of polyA + RNA is used to generate each cDNA probe. After 
hybridization, the chips are scanned and the fluorescence intensity recorded for 
both Cy3 and Cy5 channels. There are multiple built-in quality control steps. 
First, the probe quality is monitored using a panel of ubiquitously expressed 
genes. Secondly, the control plate also can include yeast DNA fragments of 

30 which complementary RNA may be spiked into the probe synthesis for 
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measuring the quality of the probe and the sensitivity of the analysis. Currently, 
the technology offers a sensitivity of 1 in 1 00,000 copies of mRNA. Finally, the 
reproducibility of this technology can be ensured by including duplicated control 
cDNA elements at different locations. 

5 EXAMPLE 16 

Analysis of cDNA Expression Using Real-time PCR 

Real-time PCR (see Gibson et al., Genome Research 6:995- 
1 001 , 1 996; Heid et al., Genome Research 6:986-994, 1 996) is a technique 
that evaluates the level of PCR product accumulation during amplification. This 

1 0 technique permits quantitative evaluation of mRNA levels in multiple samples. 
Briefly, mRNA is extracted from tumor and normal tissue and cDNA is prepared 
using standard techniques. Real-time PCR is performed, for example, using a 
Perkin Elmer/Applied Biosystems (Foster City, CA) 7700 Prism instrument. 
Matching primers and fluorescent probes are designed for genes of interest 

1 5 using, for example, the primer express program provided by Perkin 

Elmer/Applied Biosystems (Foster City, CA). Optimal concentrations of primers 
and probes are initially determined by those of ordinary skill in the art, and 
control {e.g., p-actin) primers and probes are obtained commercially from, for 
example, Perkin Elmer/Applied Biosystems (Foster City, CA). To quantitate the 

20 amount of specific RNA in a sample, a standard curve is generated using a 
plasmid containing the gene of interest. Standard curves are generated using 
the a values determined in the real-time PCR, which are related to the initial 
cDNA concentration used in the assay. Standard dilutions ranging from 10-10 6 
copies of the gene of interest are generally sufficient. In addition, a standard 

25 curve is generated for the control sequence. This permits standardization of 
initial RNA content of a tissue sample to the amount of control for comparison 
purposes. 

An alternative real-time PCR procedure can be carried out as 
follows: The first-strand cDNA to be used in the quantitative real-time PCR is 
30 synthesized from 20 m of total RNA that is first treated with DNase I (e.g. , 
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Amplification Grade, Gibco BRL Life Technology, Gaitherburg, MD), using 
Superscript Reverse Transcriptase (RT) (e.g., Gibco BRL Life Technology, 
Gaitherburg, MD). Real-time PCR is performed, for example, with a 
GeneAmp™ 5700 sequence detection system (PE Biosystems, Foster City, 

5 CA). The 5700 system uses SYBR™ green, a fluorescent dye that only 
intercalates into double stranded DNA, and a set of gene-specific forward and 
reverse primers. The increase in fluorescence is monitored during the whole 
amplification process. The optimal concentration of primers is determined 
using a checkerboard approach and a pool of cDNAs from breast tumors is 

1 0 used in this process. The PCR reaction is performed in 25 |il volumes that 
include 2.5 |xl of SYBR green buffer, 2 iaI of cDNA template and 2.5 |xl each of 
the forward and reverse primers for the gene of interest. The cDNAs used for 
RT reactions are diluted approximately 1 :10 for each gene of interest and 1 :1 00 
for the p-actin control. In order to quantitate the amount of specific cDNA (and 

1 5 hence initial mRNA) in the sample, a standard curve is generated for each run 
using the plasmid DNA containing the gene of interest. Standard curves are 
generated using the Ct values determined in the real-time PCR which are 
related to the initial cDNA concentration used in the assay. Standard dilution 
ranging from 20-2x1 0 6 copies of the gene of interest are used for this purpose. 

20 In addition, a standard curve is generated for p-actin ranging from 200fg- 
2000fg. This enables standardization of the initial RNA content of a tissue 
sample to the amount of p-actin for comparison purposes. The mean copy 
number for each group of tissues tested is normalized to a constant amount of 
p-actin, allowing the evaluation of the over-expression levels seen with each of 

25 the genes. 

EXAMPLE 17 
Peptide Priming Of T-helper Lines 

Generation of CD4 + T helper lines and identification of peptide 
epitopes derived from tumor-specific antigens that are capable of being 
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recognized by CD4 + T cells in the context of HLA class II molecules, is carried 
out as follows: 

Frfteen-mer peptides overlapping by 10 amino acids, derived from 
a tumor-specific antigen, are generated using standard procedures. Dendritic 

5 cells (DC) are derived from PBMC of a normal donor using GM-CSF and IL-4 
by standard protocols. CD4 + T cells are generated from the same donor as the 
DC using MACS beads (Miltenyi Biotec, Auburn, CA) and negative selection. 
DC are pulsed overnight with pools of the 15-mer peptides, with each peptide 
at a final concentration of 0.25 |ig/ml. Pulsed DC are washed and plated at 1 x 

10 1 0 4 cells/well of 96-well V-bottom plates and purified CD4 + T cells are added at 
1 x 10 5 /well. Cultures are supplemented with 60 ng/ml IL-6 and 10 ng/ml IL-12 
and incubated at 37°C. Cultures are restimulated as above on a weekly basis 
using DC generated and pulsed as above as antigen presenting cells, 
supplemented with 5 ng/ml IL-7 and 10 U/ml IL-2. Following 4 in vitro 

15 stimulation cycles, resulting CD4 + T cell lines (each line corresponding to one 
well) are tested for specific proliferation and cytokine production in response to 
the stimulating pools of peptide with an irrelevant pool of peptides used as a 
control. 

EXAMPLE 18 

20 Generation of Tumor-Specific CTL Lines Using In Vitro Whole-Gene 

Priming 

Using in vitro whole-gene priming with tumor antigen-vaccinia 
infected DC (see, for example, Yee et al, The Journal of Immunology, 
157(9):4079-86, 1996), human CTL lines are derived that specifically recognize 

25 autologous fibroblasts transduced with a specific tumor antigen, as determined 
by interferon-? ELISPOT analysis. Specifically, dendritic cells (DC) are 
differentiated from monocyte cultures derived from PBMC of normal human 
donors by growing for five days in RPMI medium containing 10% human 
serum, 50 ng/ml human GM-CSF and 30 ng/ml human IL-4. Following culture, 

30 DC are infected overnight with tumor antigen-recombinant vaccinia virus at a 
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multiplicity of infection (M.O.I) of five, and matured overnight by the addition of 
3 ng/ml CD40 ligand. Virus is then inactivated by UV irradiation. CD8+T cells 
are isolated using a magnetic bead system, and priming cultures are initiated 
using standard culture techniques. Cultures are restimulated every 7-10 days 

5 using autologous primary fibroblasts retrovirally transduced with previously 
identified tumor antigens. Following four stimulation cycles, CD8+ T cell lines 
are identified that specifically produce interferon-y when stimulated with tumor 
antigen-transduced autologous fibroblasts. Using a panel of HLA-mismatched 
B-LCL lines transduced with a vector expressing a tumor antigen, and 

10 measuring interferon-/ production by the CTL lines in an ELISPOT assay, the 
HLA restriction of the CTL lines is determined. 

EXAMPLE 19 

Generation and Characterization of Anti-Tumor Antigen Monoclonal 

Antibodies 

1 5 Mouse monoclonal antibodies are raised against E. coli derived 

tumor antigen proteins as follows: Mice are immunized with Complete Freund's 
Adjuvant (CFA) containing 50 \>g recombinant tumor protein, followed by a 
subsequent intraperitoneal boost with Incomplete Freund's Adjuvant (IFA) 
containing 10 |ig recombinant protein. Three days prior to removal of the 

20 spleens, the mice are immunized intravenously with approximately 50 ng of 
soluble recombinant protein. The spleen of a mouse with a positive titer to the 
tumor antigen is removed, and a single-cell suspension made and used for 
fusion to SP2/0 myeloma cells to generate B cell hybridomas. The 
supematants from the hybrid clones are tested by ELISA for specificity to 

25 recombinant tumor protein, and epitope mapped using peptides that spanned 
the entire tumor protein sequence. The mAbs are also tested by flow cytometry 
for their ability to detect tumor protein on the surface of cells stably transfected 
with the cDNA encoding the tumor protein. 
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EXAMPLE 20 
Synthesis of Polypeptides 

Polypeptides are synthesized on a Perkin Elmer/Applied 
Biosystems Division 430A peptide synthesizer using FMOC chemistry with 

5 HPTU (O-Benzotriazole-N.N.N'.N'-tetramethyluronium hexafluorophosphate) 
activation. A Gly-Cys-Gly sequence is attached to the amino terminus of the 
peptide to provide a method of conjugation, binding to an immobilized surface, 
or labeling of the peptide. Cleavage of the peptides from the solid support is 
carried out using the following cleavage mixture: trrfluoroacetic 

1 0 acid:ethanedrthiol:thioanisole:water:phenol (40:1 :2:2:3) . After cleaving for 2 
hours, the peptides are precipitated in cold methyl-t-butyl-ether. The peptide 
pellets are then dissolved in water containing 0.1% trifiuoroacetic acid (TFA) 
and lyophilized prior to purification by C1 8 reverse phase HPLC. A gradient of 
0%-60% acetonftrile (containing 0.1% TFA) in water (containing 0.1% TFA) is 

1 5 used to elute the peptides. Following lyophil'ization of the pure fractions, the 
peptides are characterized using electrospray or other types of mass 
spectrometry and by amino acid analysis. 

EXAMPLE 21 

Generation of B305D-Specific CTL Lines and Clones 
20 Using In V/tro Whole-Gene Priming 

This example describes the generation of B305D-specific CD8+ T 
lymphocytes from a normal donor and identification of the HLA restriction of 
two CD8+ T cell clones. B305D C isoform is a breast tumor antigen that is 
preferentially expressed in breast tumors as compared to normal breast tissue. 
25 These experiments further confirm the immunogenicity of the B305D protein 
and support its use as a target for vaccine and/or other immunotherapeutic 
approaches. 

Standard in-vitro priming was established in 96-well plates 
generally as described in Example 18. More specifically, a total of 960 cultures 
30 were established, using as APC DC infected with adenovirus expressing 
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B305D C isoform (SEQ ID NO: 301) for the initial stimulation, and autologous 
fibroblasts transduced to express the 5' or 3' 1/2 of B305D C isoform for 3 
additional stimulations. T cell lines were screened by y-IFN ELISPOT assays 
on fibroblasts expressing either the 5' half (amino acids 1 - 200 of SEQ ID 
5 NO:304) or the 3' half (amino acids 1 60 - 384 of SEQ ID NO:304) of B305D C 
isoform. Six T cell lines were identified that recognized either the 5' fragment 
(3B9 ( 7E5, and 8H8) or 3' fragment (4G2, 5E6, 7E10) of B305D C isoform. 
Clones were then generated from lines 3B9, 5E6, and 8H8 and shown to 
recognize B305D by y-IFN ELISPOT assay. Antibody blocking y-IFN ELISPOT 

10 assays were performed to identify the relevant restricting alleles of each of the 
clones. The activity of 8H8 and 3B9 clones (3 1 fragment specific) was 
specifically blocked by pan class I and HLA-B/C blocking antibodies, and the 
activity of 5E6 clones was blocked by pan class I and HLA -A2 blocking 
antibodies. These results suggest that the restricting allele for the 8H8 and 

1 5 5E6 response is one of the B or C alleles of the donor, D385 (B7, B35, Cw4, 
Cw7), and the restricting allele for the 3B9 clone is the HLA-A0205 allele 
expressed by D385. These results further suggest that there are at least 2 
epitopes from B305D that are recognized by these T cell clones. 

In summary, these data demonstrate that precursor T cells 

20 specific for B305D C isoform exist that can be activated by vaccination 
strategies, and additionally indicate that naturally processed epitopes from 
B305D exist that can be used for both vaccination and immune monitoring 
strategies. 

From the foregoing it will be appreciated that, although specific 
25 embodiments of the invention have been described herein for purposes of 
illustration, various modifications may be made without deviating from the spirit 
and scope of the invention. Accordingly, the invention is not limited except as 
by the appended claims. 
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CLAIMS 

What is Claimed: 

1 . An isolated polynucleotide comprising a sequence selected 
from the group consisting of: 

(a) sequences provided in SEQ ID NO:342, 341 , and 343-344; 

(b) complements of the sequences provided in SEQ ID 
NO:342, 341 , and 343-344; 

(c) sequences consisting of at least 20 contiguous residues of 
a sequence provided in SEQ ID NO:342, 341, and 343-344; 

(d) sequences that hybridize to a sequence provided in SEQ 
ID N0.342, 341 , and 343-344, under highly stringent conditions; 

(e) sequences having at least 75% identity to a sequence of 
SEQ ID NO:342, 341 , and 343-344; 

(f) sequences having at least 90% identity to a sequence of 
SEQ ID NO:342, 341, and 343-344; and 

(g) degenerate variants of a sequence provided in SEQ ID 
NO:342, 341 , and 343-344. 

2. An isolated polypeptide comprising an amino acid 
sequence selected from the group consisting of: 

(a) sequences encoded by a polynucleotide of claim 1 ; and 

(b) sequences having at least 70% identity to-a sequence 
encoded by a polynucleotide of claim 1; and 

(c) sequences having at least 90% identity to a sequence 
encoded by a polynucleotide of claim 1; 

(d) sequences set forth in SEQ ID NO:345-428; 

(e) sequences having at least 70% identity to a sequence set 
forth in SEQ ID NO:345-428; and 
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(f) sequences having at least 90% identity to a sequence set 
forth in SEQ ID NO:345-428.. 

3. An expression vector comprising a polynucleotide of claim 
1 operably linked to an expression control sequence. 

4. A host cell transformed or transfected with an expression 
vector according to claim 3. 

5. An isolated antibody, or antigen-binding fragment thereof, 
that specifically binds to a polypeptide of claim 2. 

6. A method for detecting the presence of a cancer in a 
patient, comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with a binding agent that 
binds to a polypeptide of claim 2; 

(c) detecting in the sample an amount of polypeptide that 

binds to the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined 
cutoff value and therefrom determining the presence of a cancer in the patient. 

7. A fusion protein comprising at least one polypeptide 
according to claim 2. 

8. An oligonucleotide that hybridizes to a sequence recited in 
SEQ ID NO:342, 341 , and 343-344 under highly stringent conditions. 
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9. A method for stimulating and/or expanding T cells specific 
for a tumor protein, comprising contacting T cells with at least one component 
selected from the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1 ; and 

(c) antigen-presenting cells that express a polynucleotide 

according to claim 1 , 

under conditions and for a time sufficient to permit the stimulation 

and/or expansion of T cells. 

1 0. An isolated T cell population, comprising T cells prepared 
according to the method of claim 9. 

11. A composition comprising a first component selected from 
the group consisting of physiologically acceptable carriers and 
immunostimulants, and a second component selected from the group 
consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1 ; 

(c) antibodies according to claim 5; 

(d) fusion proteins according to claim 7; 

(e) T cell populations according to claim 1 0; and 

(f) antigen presenting cells that express a polypeptide 
according to claim 2. 

1 2. A method for stimulating an immune response in a patient, 
comprising administering to the patient a composition of claim 1 1 . 

1 3. A method for the treatment of a breast cancer in a patient, 
comprising administering to the patient a composition of claim 1 1 . 
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1 4. A method for determining the presence of a cancer in a 
patient, comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 
according to claim 8; 

(c) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; and 

(d) comparing the amount of polynucleotide that hybridizes to 
the oligonucleotide to a predetermined cut-off value, and therefrom determining 
the presence of the cancer in the patient. 

1 5. A diagnostic kit comprising at least one oligonucleotide 
according to claim 8. 

1 6. A diagnostic kit comprising at least one antibody according 
to daim 5 and a detection reagent, wherein the detection reagent comprises a 
reporter group. 

1 7. A method for the treatment of breast cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a 
patient with at least one component selected from the group consisting of: (i) 
polypeptides according to claim 2; (ii) polynucleotides according to claim 1 ; and 
(Hi) antigen presenting cells that express a polypeptide of claim 2, such that T 
cell proliferate; 

(b) administering to the patient an effective amount of the 
proliferated T cells, 

and thereby inhibiting the development of a cancer in the patient. 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B18Ag1 

TTA GAG ACC CAA TTG GGA CCT AAT TGG GAC CCA AAT HC TCA AGT GGA 48 
Leu Glu Thr Gin Leu Gly Pro Asn Tip Asp Pro Asn Phe Ser Ser Gly 
1 5 10 15 

GGG AGA ACT TTT GAC GAT TTC CAC CGG TAT CTC CTC GTG GGT ATT CAG 96 
Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly He Gin 
20 25 30 

GGA GCT GCC CAG AAA CCT ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 144 
Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 
35 40 45 



CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 192 
Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 
50 55 60 



GCT TAT CGG ATT TAC ACC CCT TTT GAC CTG GCA GCC CCC GAA AAT AGC 240 
Ala Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 

CAT GCT CTT AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 288 
His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 

85 90 95 

AGG AAA CTC CAA AAA CTA GAG GGA TTT TGC TGG AAT GAA TAC CAG TCA 336 
Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 

GCT TTT AGA GAT AGC CTA AAA GGT TTT 363 
Ala Phe Arg Asp Ser Leu Lys Gly Phe 

115 120 p ig g 

NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag1 

GC TGGGCACAGT GGCTCATACC TGTAATCCTG ACCGTTTCAG AGGCTCAGGT 60 

CG CTTGAGCCCA AGATTTCAAG ACTAGTCTGG GTAACATAGT GAGACCCTAT 120 

AA AAATAAAAAA ATGAGCCTGG TGTAGTGGCA CACACCAGCT GAGGAGGGAG 180 

CT AGGAGA 1% 



Fig. 7 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag2 

GC TTGGGGGCTC TGACTAGAAA TTCAAGGAAC CTGGGATTCA AGTCCAACTG 60 
AC TTACACTGTG GNCTCCAATA AACTGCTTCT TTCCTATTCC QCTCTATTA 120 
AA GGAAAACGAT GTCTGTGTAT AGCCAAGTCA GNTATCCTAA AAGGAGATAC 180 
AT TAAATATCAG AATGTAAAAC CTGGGAACCA GGTTCCCAGC CTGGGATTAA 240 
CA AGAAGACTGA ACAGTACTAC TGTGAAAAGC CCGAAGNGGC AATATGTTCA 300 
TT GAAGGATGGC TGGGAGAATG AATGCTCTGT CCCCCAGTCC CAAGCTCACT 360 
CT CCTTTATAGC CTAGGAGA 388 

Fig. 8 



NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag2a 

GC CTATAATCAT GTTTCTCATT ATTTTCACAT TTTATTAACC AATTTCTGTT 60 
AA AATATGAGGG AAATATATGA AACAGGGAGG CAATGTTCAG ATAATTGATC 120 
TG ATTTCTACAT CAGATGCTCT TTCCTTTCCT GTTTATTTCC TTTTTATTTC .180 
GG TCGAATGTAA TAGCTTTGTT TCAAGAGAGA GTTTTGGCAG TTTCTGTAGC 240 
CT GCTCATGTCT CCAGGCATCT ATTTGCACTT TAGGAGGTGT CGTGGGAGAC 300 
CT ATTTTTTCCA TATTTGGGCA ACTACTA 337 

Fig. 9 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag1b 

6C CATACAGT6C CTTTCCATTT ATTTAACCCC CACCTGAACG 6CATAAACT6 60 

GC TGGTGTTTTT TACTGTAAAC AATAAGGAGA CTTTGCTCTT CATTTAAACC 120 

AT TTCATATTTT ACGCTCGAGG GTTTTTACCG GTTCCTTTTT ACACTCCTTA 180 

TT TAAGTCGTTT GGAACAAGAT ATTTTTTCTT TCCTGGCAGC TTTTAACATT 240 

TT TGTGTCTGGG GGACTGCTGG TCACTGTTTC TCACAGTTGC AAATCAAGGC 300 
CC AAGAAAAAAA AATTTTTTTG TTTTATTTGA AACTGGACCG GATAAACGGT 360 

CG GCTGCTGTAT ATAGTTTTAA ATGGTTTATT GCACCTCCTT AAGTTGCACT 420 

GG GGGGNTTTTG NATAGAAAGT NTTTANTCAC ANAGTCACAG GGACTTTTNT 480 

NA CTGAGCTAAA AAGGGCTGNT TTTCGGGTGG GGGCAGATGA AGGCTCACAG 540 
TC TCTTAGAGGG GGGAACTNCT A 571 

Fig. 10 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag1a 

TA ATAACTTAAA TATATTTT6A TCACCCACTG 6GGTGATAAG ACAATAGATA 60 
TT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT 120 
CC GCACTGAAAC TTCACCTTCT AACT6TCTAC CTAACCAAAT TCTACCCTTC 180 
GG TGCGTGCTCA CTACTCTTTT TTTTTTTTTT TTTNTTTTGG AGATGGAGTC 240 
CA GCCCAGGGGT GGAGTACAAT GGCACAACCT CAGCTCACTG NAACCTCCGC 300 
TT CATGAGATTC TCCTGNTTCA GCCTTCCCAG TAGCTGGGAC TACAGGTGTG 360 
TG CCT6GNTAAT CTTTTTTNGT TTTNGGGTAG AGATGGGGGT TTTACATGTT 420 
TG GTNTCGAACT CCTGACCTCA AGTGATCCAC CCACCTCAGG CTCCCAAAGT 480 
TA CAGACATGAG CCACTGNGCC CAGNCCTGGT GCATGCTCAC TTCTCTAGGC 540 

Fig. 11 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B11Ag1 

T6 CACATGCA6A ATATTCTATC 66TACTTCAG CTATTACTCA TTTT6AT6GC 60 
AG CCTATCCTCA AGATGAGTAT TTAGAAAGAA TTGATTTAGC GATAGACCAA 120 
GC ACTCTGACTA CACGAAATTG TTCAGATGTG ATGGATTTAT GACAGTTGAT 180 
GA GATTATTAAG TGATTATTTT AAAGGGAATC CATTAATTCC AGAATATCTT 240 
TC AAGATGATAT AGAAATAGAA CAGAAAGAGA CTACAAATGA AGATGTATCA 300 
TA TTGAAGAGCC TATAGTAGAA AATGAATTAG CTGCATTTAT TAGCCTTACA 360 
TT TTCCTGATGA ATCTTATATT CAGCCATCGA CATAGCATTA CCTGATGGGC 420 
GA ATAATAGAAA CTGGGTGCGG GGCTATTGAT GAATTCATCC NCAGTAAATT 480 
AC AAAATATAAC TCGATTGCAT TTGGATGATG GAATACTAAA TCTGGCAAAA 540 
GG AGCTACTAGT AACCTCTCTT TTTGAGATGC AAAATTTTCT TTTAGGGTTT 600 
CT ACTTTAC6GA TATT6GAGCA TAACGGGA 638 

Fig. 12 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3c 

ACTGATGGAT GTCGCCGGAG GCGAGGGGCC TTATCTGATG CTCGGCTGCC TGTTCGTGAT 60 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 120 

TGCCTTAGCG GCGGCGAAGT CAATGGGCGT CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 180 

GATGGCGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTGGTGCT 240 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 286 

Fig. 13 



NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG1 

AG CAGCCCCTTC TTCTCAATTT CATCTGTCAC TACCCTGGTG TAGTATCTCA 60 
CA TTTTTATAGC CTCCTCCCTG GTCTGTCTTT TGATTTTCCT 6CCTGTAATC 120 
AC ATAACTGCAA GTAAACATTT CTAAA6TGTG GTTATGCTCA TGTCACTCCT 180 
AA ATAGTTTCCA TTACCGTCTT AATAAAATTC GGATTTGTTC TTTNCTATTN 240 
CA CCTATGACCG AA 262 

Fig. 14 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG3 

A6 CAAAGCCAGT 6GTTT6AGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC 60 

TA AATGGTGGCA GGATTTTTAT TATAAACATG TACCCATGCA AATTTCCTAT 120 
GA TATATTCTTC TACATTTAAA CAATAAAAAT AATCTATTTT TAAAAGCCTA 180 
AG TTAGGTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA 240 
TG CCTATGACCG A 261 

Fig. 15 



NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B2CA2 

GG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 120 

GG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CG NCTTGCNANG ATCTTCAT 208 

Fig. 16 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA1 

66 6CAT66AC6C A6AC6CCT6A C6TTT66CT6 AAAATCTTTC ATT6ATTC6T 60 

AT AG6AAAATTC CCAAA6AG6G AATGTCCT6T TGCTCGCCA6 TTTTTNT6TT 120 

66 ANAAGGCAAN GAGCTCTTCA GACTATT66N ATTNTC6TTC GGTCTTCT6C 180 

C6 NCTT6CNANG ATCTTCAT 208 

Fig. 17 

NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA2 

6G GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGA66G AATGTCCTGT T6CTCGCCAG TTTTTNT6TT 120 

GG ANAAGGCAAN GAGCTCTTCA 6ACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 
CGNCTTGCNANG ATCTTCAT 208 

Fig. 18 

NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3 

AG 66A6CAAGGA GAAGGCAT66 A6A6GCTCAN 6CT6GTCCTG GCCTAC6ACT 60 

CT GTCGCCGG6G AT66T6GAGA ACTGAAGCGG GACCTCCTCG AGGTCCTCCG 120 

TC NCCGTCCA66 AG6AG6GTCT TTCCGTGGTC TNG6A66AGC G6666GA6AA 180 
TC ATGGTCNACA TCCC 204 

Fig. 19 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B4CA1 

TC A6GA6C666T A6A6T66CAC CATT6A666G ATATTCAAAA ATATTATTTT 60 

T6 ATA6TT6CTG AGTTTTTCTT TGACCCATGA GTTATATTGG AGTTTATTTT 120 

CC AATCGCATGG ACATGTTAGA CTTATTTTCT GTTAATGATT NCTATTTTTA 180 

GA TTTGAGAAAT TGGTTNTTAT TATATCAATT TTTGGTATTT GTTGAGTTTG 240 
GC TTAGTATGTG ACCA 264 

Fig. 20 
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Recogniiion of Peptide by D142 anti-B11-8 
CTL line 

60-. , 




E:T ratio (x:1) 



Fig. 22 
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Recognition of B11 Transductant by B11-8 
Specific Clone A1 




E:T ratio (x:1) 



Fig. 23 
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SEQUENCE LISTING 



<110> Corixa Corporation 
F anger, Gary R. 
Hirst, Shannon Kathleen 
Dillon, Davin C- 
Foy, Teresa M. 
Houghton, Raymond L. 
Fersing, David H. 
Kalos, Michael D. 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF BREAST CANCER 



<130> 210121. 41931PC 

<140> US 

<141> 2002-08-05 

<160> 428 

<170> FastSEQ for Windows Version 4.0 

<210> 1 . 

<211> 363 

<212> DNA 

<213> Homo sapiens 

<400> 1 

ttagagaccc aattgggacc taattgggac ccaaatttct caagtggagg gagaactttt 60 

gacgatttcc accggtatct cctcgtgggt attcagggag ctgcccagaa acctataaac 120 

ttgtctaagg cgattgaagt cgtccagggg catgatgagt caccaggagt gtttttagag 180 

cacctccagg aggcttatcg gatttacacc ccttttgacc tggcagcccc cgaaaatagc 240 

catgctctta atttggcatt tgtggctcag gcagccccag atagtaaaag gaaactccaa 300 

aaactagagg gattttgctg gaatgaatac cagtcagctt ttagagatag cctaaaaggt 360 

ttt 363 

<210> 2 

<211> 121 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Leu Glu Thr Gin Leu Gly 'Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 

-1 5 10 15 

Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly He Gin 

20 25 30 

Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 

35 40 45 

Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 

50 55 60 

Ala Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 

His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 
85 90 95 



WO03013431 [http://www.getthepaten^^ 



WO 03/013431 PCT/US02/24917 

2 



Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 

100 105 HO 

Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 



<210> 3 

<211> 1080 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 681, 685, 706, 720, 741, 752, 758, 780, 789, 824, 840, 859, 
866, 884, 890, 905, 917, 926, 930, 951, 957, 959, 962, 974, 
980, 982, 988, 995, 996, 1007, 1010, 1025, 1040, 1051, 1052, 
1056, 1057, 1078 
<223> n « A,T,C or G 

<400> 3 

tcttagaatc ttcatacccc gaactcttgg gaaaacttta atcagtcacc tacagtctac 60 
cacccattta ggaggagcaa agctacctca gctcctccgg agccgtttta agatccccca 120 
tcttcaaagc ctaacagatc aagcagctct ccggtgcaca acctgcgccc aggtaaatgc 180 
caaaaaaggt cctaaaccca gcccaggcca ccgtctccaa gaaaactcac caggagaaaa 240 
gtgggaaatt gactttacag aagtaaaacc acaccgggct gggtacaaat accttctagt 300 
actggtagac accttctctg gatggactga agcatttgct accaaaaacg aaactgtcaa 360 
tatggtagtt aagtttttac tcaatgaaat catccctcga cgtgggctgc ctgttgccat 420 
agggtctgat aatggaacgg ccttcgcctt gtctatagtt taatcagtca gtaaggcgtt 480 
aaacattcaa tggaagctcc attgtgccta tcgacccaga gctctgggca agtagaacgc 540 
atgaactgca ccctaaaaaa acactcttac aaaattaatc ttaaaaaccg gtgttaattg 600 
tgttagtctc cttcccttag ccctacttag agttaaggtg caccccttac tgggctgggt 660 
tctttacctt ttgaaatcat ntttnggaag gggctgccta tctttnctta actaaaaaan 720 
gcccatttgg caaaaatttc ncaactaatt tntacgtncc tacgtctccc caacaggtan 780 
aaaaatctnc tgcccttttc aaggaaccat cccatccatt cctnaacaaa aggcctgccn 840 
ttcttccccc agttaactnt tttttnttaa aattcccaaa aaangaaccn cctgctggaa 900 
aaacnccccc ctccaanccc cggccnaagn ggaaggttcc cttgaatccc ncccccncna 960 
anggcccgga accnttaaan tngttccngg gggtnnggcc taaaagnccn atttggtaaa 1020 
cctanaaatt ttttcttttn taaaaaccac nntttnnttt ttcttaaaca aaaccctntt 1080 



<210> 4 

<211> 1087 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 559, 574, 576, 581, 582, 587, 589, 593, 594, 609, 627, 640, 
659, 668, 672, 677, 691, 713, 714, 732, 741, 812, 813, 823, 
825, 829, 838, 845, 849, 852, 855, 856, 859, 874, 876, 877, 
892, 902, 907, 916, 917, 938, 950, 951, 952, 953, 960 
<223> n = A,T,C or G 

<221> misc — feature 

<222> 965, 974, 976, 978, 982, 996, 1005, 1012, 1049, 1058, 1073, 
1074, 1082, 1084, 1086 
<223> n = A,T,C or G 
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<400> 4 

tctagagctg cgcctggatc 
caagtaggcc ctttaaacta 
tttccatcat tttaaggggt 
atctgtagac ctcaggctcc 
gccaggtttc agctgcagat 
aggttaaaat cctataggct 
gattttggat ttttaacttt 
gctttaaagt actgttagtg 
agttacccta attccaaatg 
gttttatatt ttcccatcna 
aaaacggant cttgctccgt 
cttgcaanaa tnctgcntcc 
acctggaatt anaggccccc 
acgggtttcc tgttttagtt 
cctcnaatnt tnggnntang 
tntggantct tgaatnncgg 
ttttncctcc cccntntnta 
tttttctccc cccccctctt 
cnantnt 

<210> 5 
<211> 1010 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 311, 315, 318, 339, 341, 347, 361, 379, 391, 415, 417, 419, 
424, 430, 433, 454, 463, 465, 467, 476, 497, 499, 550, 562, 
564, 587, 591, 595, 597, 598, 612, 625, 631, 640, 641, 645, 
648, 656, 661, 665, 666, 670, 674, 675, 681, 682, 683 
<223> n = A,T,C or G 

<221> misc_f eature 

<222> 687, 688, 692, 710, 721, 778, 788, 811, 820, 830, 860, 867, 
868, 871, 872, 889, 892, 896, 897, 899, 904, 915, 936, 951, 
960, 970, 986, 990, 1000 
<223> n = A,T,C or G 

<400> 5 

tctagaccaa gaaatgggag gattttagag tgactgatga tttctctatc atctgcagtt 60 
agtaaacatt ctccacagtt tatgcaaaaa gtaacaaaac cactgcagat gacaaacact 120 
aggtaacaca catactatct cccaaatacc tacccacaag ctcaacaatt ttaaactgtt 180 
aggatcactg gctctaatca ccatgacatg aggtcaccac caaaccatca agcgctaaac 240 
agacagaatg tttccactcc tgatccactg tgtgggaaga agcaccgaac ttacccactg 300 
gggggcctgc ntcanaanaa aagcccatgc ccccgggtnt ncctttnaac cggaacgaat 360 
naacccacca tccccacanc tcctctgttc ntgggccctg catcttgtgg cctcntntnc 420 
tttnggggan acntggggaa ggtaccccat ttcnttgacc ccncnanaaa accccngtgg 480 
ccctttgccc tgattcncnt gggccttttc tcttttccct tttgggttgt ttaaattccc 540 
aatgtccccn gaaccctctc cntnctgccc aaaacctacc taaattnctc nctangnntt 600 
ttcttggtgt tncttttcaa aggtnacctt ncctgttcan ncccnacnaa aatttnttcc 660 
ntatnntggn cccnnaaaaa nnnatcnncc cnaattgccc gaattggttn ggtttttcct 720 
nctgggggaa accctttaaa tttccccctt ggccggcccc ccttttttcc cccctttnga 780 
aggcaggngg ttcttcccga acttccaatt ncaacagccn tgcccattgn tgaaaccctt 840 
ttcctaaaat taaaaaatan ccggttnngg nnggcctctt tcccctccng gngggnngng 900 
aaantcctta ccccnaaaaa ggttgcttag cccccngtcc ccactccccc nggaaaaatn 960 
aaccttttcn aaaaaaggaa tataantttn ccactccttn gttctcttcc 1010 



ccgccacagt gaggagacct 
ctcacctgtg ttgtcttcta 
taaaatcatc ttgttcagac 
aaccataccc caagagttgt 
atccctggaa ggaatattcc 
tcttctgttt tgaggaagag 
aatgcttgtg aaacgctata 
agaaattaaa attccttcag 
ttttggtggt tagaatcttc 
gataaattct ctcncncctt 
tgtccangct gggaattttn 
caaaattacc ncctttttcc 
nccccccccc cggctaattt 
aggatggccc anntctgacc 
gcttaccccc cccngnngtt 
gttttccctt ttaaaccnat 
angggggttt cccaanccgg 
ttttctttnc cccaaaantc 



gaagaccaga gaaaacacag 60 
atttattctg ttttattttg 120 
ctcagcatat aaaatgaccc 180 
ctggttttgt ttaaattact 240 
agattccctg agtagtttcc 300 
ttcctgtcag agaaaaacat 360 
aaaaaaattt tctaccccta 420 
gaggattaaa ctgccatttc 480 
tttaatgttc ttgaagaagt 540 
nnttttntnt ctnntttttt 600 
ttttggccaa tctccgctnc 660 
cacctccacc ccnnggaatt 720 
gtttttgttt ttagtaaaaa 780 
ccntnatcnt ccccctcngc 840 
tttcctccat tnaaattttc 900 
tttttttttn nnncccccan 960 
gtccnccccc angtccccaa 1020 
ctatcttttc ctnnaaatat 1080 

1087 
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<210> 6 

<211> 950 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 199, 200, 209, 223, 224, 236, 240, 241, 244, 248, 249, 262, 
263, 267, 268, 269, 270, 271, 272, 273, 280, 281, 283, 285, 
286, 287, 288, 289, 290, 291, 293, 295, 296, 300, 302, 303, 
309, 313, 314, 315, 316, 317, 318, 319, 320, 322, 323 
<223> n = A,T,C or G 

<221> miscfeature 

<222> 326, 327, 331, 332, 339, 342, 343, 344, 346, 349, 352, 353, 
355, 356, 359, 360, 362, 363, 364, 367, 369, 371, 375, 377, 
378, 379, 383, 385, 387, 389, 390, 392, 396, 397, 399, 400, 
401, 402, 405, 406, 408, 409, 410, 412, 413, 414, 415 
<223> n - A,T,C or G 

<221> misc_feature 

<222> 417, 419, 420, 423, 424, 428, 431, 433, 434, 435, 437, 438, 
439, 443, 447, 449, 450, 455, 456, 458, 459, 462, 465, 467, 
469, 472, 480, 481, 483, 484, 485, 486, 487, 488, 493, 494, 
495, 496, 497, 502, 505, 507, 508, 510, 512, 517, 518 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 520, 521, 524, 526, 531, 536, 538, 539, 543, 544, 548, 549, 
550, 552, 553, 555, 556, 557, 561, 563, 566, 570, 571, 572, 
576, 577, 579, 580, 582, 583, 585, 588, 590, 591, 592, 594, 
597, 603, 606, 607, 614, 616, 618, 620, 621, 622, 623 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 625, 628, 629, 630, 632, 634, 637, 638, 641, 645, 651, 652, 
653, 658, 659, 663, 664, 668, 672, 673, 674, 678, 685, 689, 
696, 700, 701, 702, 704, 705, 706, 708, 710, 711, 712, 713, 
715, 719, 722, 725, 727, 731, 734, 735, 737, 739, 742 
<223> n » A,T,C or G 

<221> misc_feature 

<222> 745, 748, 749, 751, 752, 754, 755, 757, 759, 762, 765, 767, 
769, 773, 774, 775, 778, 780, 783, 785, 787, 790, 793, 797, 
800, 803, 810, 812, 824, 828, 832, 836, 839, 843, 844, 846, 
848, 850, 852, 853, 855, 858, 859, 861, 864, 865, 866 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 868, 869, 872, 875, 880, 886, 889, 890, 891, 892, 893, 895, 
896, 901, 902, 906, 908, 913, 914, 916, 918, 921, 924, 925, 
930, 932, 935, 940 
<223> n = A,T,C or G 

<400> 6 

tctagagctc gcggccgcga gctctaatac gactcactat agggcgtcga ctcgatctca 60 
gctcactgca atctctgccc ccggggtcat gcgattctcc tgcctcagcc ttccaagtag 120 
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ctgggattac aggcgtgcaa caccacaccc ggctaatttt gtatttttaa tagagatggg 180 

gttttccctt gttggccann atggtctcna acccctgacc tcnngtgatc cccccncccn 240 
nganctcnna ctgctgggga tnnccgnnnn nnncctcccn ncncnnnnnn ncncnntccn 300 
tnntccttnc tcnnnnnnnn cnntcnntcc nncttctcnc cnnntnttnt cnncnnccnn 360 

cnnnccncnt ncccncnnnt tcncntncnn tntccnncnn nntcnncnnn cnnnncntnn 420 

ccnntacntc ntnnncnnnt ccntctntnn cctcnncnnt cnctncncnt tntctcctcn 480 

ntnnnnnnct ccnnnnntct cntcncnncn tncctcnntn nccncncccc ncctcncnnc 540 

ctnntttnnn cnncnnntcc ntnccnttcn nntccnntnn cnncntcncn nncnttnttc 600 

ccnccnnttc cttncncntn nnntntcnnn cncntcnntc ntttnctcct nnntcccnnc 660 

tcnnttcncc cnnntccncc ccccncctnt ctctcncccn nntnnntntn nnncntccnc 720 

tntcncnttc ntcnntncnt tnctntcnnc nncnntncnc tnccntntnt ctnnntcncn 780 

tcncntntcn ccntccnttn ctntctcctn tntccttccc ctcncctnct cnttcnccnc 840 

ccnntntntn tnncnccnnt nctnnncnnc cntcntttcn tctctnctnn nnntnncctc 900 

nncccntncc ctnntncnct nctnntaccn tnctnctccn tcttccttcc 950 

<210> 7 

<211> 1086 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 501, 691, 711, 735, 751, 780, 810, 819, 826, 832, 849, 889, 
890, 904, 913, 920, 926, 937, 940, 953, 957, 960, 985, 993, 
994, 1000, 1012, 1044, 1060, 1063, 1080, 1081 
<223> n = A,T,C or G 

<400> 7 

tctagagctc gcggccgcga gctcaattaa ccctcactaa agggagtcga ctcgatcaga 60 
ctgttactgt gtctatgtag aaagaagtag acataagaga ttccattttg ttctgtacta 120 
agaaaaattc ttctgccttg agatgctgtt aatctgtaac cctagcccca accctgtgct 180 
cacagagaca tgtgctgtgt tgactcaagg ttcaatggat ttagggctat gctttgttaa 240 
aaaagtgctt gaagataata tgcttgttaa aagtcatcac cattctctaa tctcaagtac 300 
ccagggacac aatacactgc ggaaggccgc agggacctct gtctaggaaa gccaggtatt 360 
gtccaagatt tctccccatg tgatagcctg agatatggcc tcatgggaag ggtaagacct 420 
gactgtcccc cagcccgaca tcccccagcc cgacatcccc cagcccgaca cccgaaaagg 480 
gtctgtgctg aggaagatta ntaaaagagg aaggctcttt gcattgaagt aagaagaagg 540 
ctctgtctcc tgctcgtccc tgggcaataa aatgtcttgg tgttaaaccc gaatgtatgt 600 
tctacttact gagaatagga gaaaacatcc ttagggctgg aggtgagaca ccctggcggc 660 
atactgctct ttaatgcacg agatgtttgt ntaattgcca tccagggcca ncccctttcc 720 
ttaacttttt atganacaaa aactttgttc ncttttcctg cgaacctctc cccctattan 780 
cctattggcc tgcccatccc ctccccaaan ggtgaaaana tgttcntaaa tncgagggaa 840 
tccaaaacnt tttcccgttg gtcccctttc caaccccgtc cctgggccnn tttcctcccc 900 
aacntgtccc ggntccttcn ttcccncccc cttcccngan aaaaaacccc gtntganggn 960 
gccccctcaa attataacct ttccnaaaca aannggttcn aaggtggttt gnttccggtg 1020 
cggctggcct tgaggtcccc cctncacccc aatttggaan ccngtttttt ttattgcccn 1080 
ntcccc 1086 

<210> 8 

<211> 1177 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 1, 4, 20, 21, 31, 278, 314, 332, 359, 371, 373, 375, 376, 
524, 537, 556, 557, 579, 583, 590, 591, 598, 623, 625, 648, 
700, 703, 719, 738, 742, 746, 749, 751, 752, 800, 808, 820, 
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821, 824, 835, 838, 845, 851, 856, 864, 865, 879, 888 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 911, 920, 926, 935, 945, 950, 952, 956, 969, 972, 977, 981, 
992, 999, 1023, 1024, 1032, 1038, 1039, 1040, 1062, 1069, 
1075, 1084, 1089, 1104, 1119, 1123, 1131, 1143, 1146, 1152, 
1165, 1169, 1172, 1176 
<223> n = A,T,C or G 

<400> 8 

nccntttaga tgttgacaan ntaaacaagc ngctcaggca gctgaaaaaa gccactgata 60 
aagcatcctg gagtatcaga gtttactgtt agatcagcct catttgactt cccctcccac 120 
atggtgttta aatccagcta cactacttcc tgactcaaac tccactattc ctgttcatga 180 
ctgtcaggaa ctgttggaaa ctactgaaac tggccgacct gatcttcaaa atgtgcccct 240 
aggaaaggtg gatgccaccg tgttcacaga cagtaccncc ttcctcgaga agggactacg 300 
aggggccggt gcanctgtta ccaaggagac tnatgtgttg tgggctcagg ctttaccanc 360 
aaacacctca ncncnnaagg ctgaattgat cgccctcact caggctctcg gatggggtaa 420 
gggatattaa cgttaacact gacagcaggt acgcctttgc tactgtgcat gtacgtggag 480 
ccatctacca ggagcgtggg ctactcactc ggcaggtggc tgtnatccac tgtaaangga 540 
catcaaaagg aaaacnnggc tgttgcccgt ggtaaccana aanctgatcn ncagctcnaa 600 
gatgctgtgt tgactttcac tcncncctct taaacttgct gcccacantc tcctttccca 660 
accagatctg cctgacaatc cccatactca aaaaaaaaan aanactggcc ccgaacccna 720 
accaataaaa acggggangg tnggtnganc nncctgaccc aaaaataatg gatcccccgg 780 
gctgcaggaa ttcaattcan ccttatcnat acccccaacn nggngggggg ggccngtncc 840 
cattncccct ntattnattc tttnnccccc cccccggcnt cctttttnaa ctcgtgaaag 900 
ggaaaacctg ncttaccaan ttatcncctg gaccntcccc ttccncggtn gnttanaaaa 960 
aaaagcccnc antcccntcc naaatttgca cngaaaggna aggaatttaa cctttatttt 1020 
ttnntccttt antttgtnnn ccccctttta cccaggcgaa cngccatcnt ttaanaaaaa 1080 
aaanagaang tttatttttc cttngaacca tcccaatana aancacccgc nggggaacgg 1140 
ggnggnaggc cnctcacccc ctttntgtng gngggnc 1177 

<210> 9 

<211> 1146 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1, 4, 5, 8, 9, 348, 706, 742, 745, 751, 758, 772, 793, 819, 
842, 846, 860, 866, 886, 889, 911, 939, 945, 955, 960, 982, 
999, 1002, 1005, 1009, 1010, 1033, 1047, 1049, 1055, 1058, 
1069, 1074, 1079, 1081, 1104, 1105, 1111, 1116, 1118 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 1121, 1130, 1135, 1136, 1146 
<223> n = A,T,C or G 

<400> 9 

nccnnttnnt gatgttgtct ttttggcctc tctttggata ctttccctct cttcagaggt 60 
gaaaagggtc aaaaggagct gttgacagtc atcccaggtg ggccaatgtg tccagagtac 120 
agactccatc agtgaggtca aagcctgggg cttttcagag aagggaggat tatgggtttt 180 
ccaattatac aagtcagaag tagaaagaag ggacataaac caggaagggg gtggagcact 240 
catcacccag agggacttgt gcctctctca gtggtagtag aggggctact tcctcccacc 300 
acggttgcaa ccaagaggca atgggtgatg agcctacagg ggacatancc gaggagacat 360 
gggatgaccc taagggagta ggctggtttt aaggcggtgg gactgggtga gggaaactct 420 
cctcttcttc agagagaagc agtacagggc gagctgaacc ggctgaaggt cgaggcgaaa 480 
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acacggtctg gctcaggaag accttggaag taaaattatg aatggtgcat gaatggagcc 540 
atggaagggg tgctcctgac caaactcagc cattgatcaa tgttagggaa actgatcagg 600 
gaagccggga atttcattaa caacccgcca cacagcttga acattgtgag gttcagtgac 660 
ccttcaaggg gccactccac tccaactttg gccattctac tttgcnaaat ttccaaaact 720 
tcctttttta aggccgaatc cntantccct naaaaacnaa aaaaaatctg cncctattct 780 
ggaaaaggcc cancccttac caggctggaa gaaattttnc cttttttttt tttttgaagg 840 
cntttnttaa attgaacctn aattcncccc cccaaaaaaa aacccnccng gggggcggat 900 
ttccaaaaac naattccctt accaaaaaac aaaaacccnc ccttnttccc ttccnccctn 960 
ttcttttaat tagggagaga tnaagccccc caatttccng gnctngatnn gtttcccccc 1020 
cccccatttt ccnaaacttt ttcccancna ggaanccncc ctttttttng gtcngattna 1080 
ncaaccttcc aaaccatttt tccnnaaaaa ntttgntngg ngggaaaaan acctnntttt 1140 
atagan 1146 

<210> 10 

<211> 545 

<212> DNA 

<213> Homo sapiens 

<400> 10 

cttcattggg tacgggcccc ctcgaggtcg acggtatcga taagcttgat atcgaattcc 60 

tgcagcccgg gggatccact agttctagag tcaggaagaa ccaccaacct tcctgatttt 120 

tattggctct gagttctgag gccagttttc ttcttctgtt gagtatgcgg gattgtcagg 180 

cagatctggc tgtggaaagg agactgtggg cagcaagttt agaggcgtga ctgaaagtca 240 

cactgcatct tgagctgctg aatcagcttt ctggttacca cgggcaacag ccgtgttttc 300 

cttttgatgt cctttacagt ggattacagc cacctgctga ggtgagtagc ccacgctcct 360 

ggtagatggc tccacgtaca tgcacagtag caaaggcgta cctgctgtca gtgttaacgt 420 

taatatcctt accccatcgg agagcctgag tgagggcgat caattcagcc cttttgtgct 480 

gaggtgtttg ctggttaagc cctgaaccca caacacatct gtctccatgg taacagctgc 540 
accgg 545 

<210> 11 

<211> 196 

<212> DNA 

<213> Homo sapiens 

<400> 11 

tctcctaggc tgggcacagt ggctcatacc tgtaatcctg accgtttcag aggctcaggt 60 

ggggggatcg cttgagccca agatttcaag actagtctgg gtaacatagt gagaccctat 120 

ctctacgaaa aaataaaaaa atgagcctgg tgtagtggca cacaccagct gaggagggag 180 
aatcgagcct aggaga 196 

<210> 12 

<211> 388 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 82, 162 f 287 
<223> n = A,T,C or G 

<400> 12 

tctcctaggc ttgggggctc tgactagaaa ttcaaggaac ctgggattca agtccaactg 60 

tgacaccaac ttacactgtg gnctccaata aactgcttct ttcctattcc ctctctatta 120 

aataaaataa ggaaaacgat gtctgtgtat agccaagtca gntatcctaa aaggagatac 180 

taagtgacat taaatatcag aatgtaaaac ctgggaacca ggttcccagc ctgggattaa 240 

actgacagca agaagactga acagtactac tgtgaaaagc ccgaagnggc aatatgttca 300 

ctctaccgtt gaaggatggc tgggagaatg aatgctctgt cccccagtcc caagctcact 360 
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tactatacct cctttatagc ctaggaga 388 

<210> 13 

<211> 337 

<212> DNA 

<213> Homo sapiens 

<400> 13 

tagtagttgc ctataatcat gtttctcatt attttcacat tttattaacc aatttctgtt 60 

taccctgaaa aatatgaggg aaatatatga aacagggagg caatgttcag ataattgatc 120 

acaagatatg atttctacat cagatgctct ttcctttcct gtttatttcc tttttatttc 180 

ggttgtgggg tcgaatgtaa tagctttgtt tcaagagaga gttttggcag tttctgtagc 240 

ttctgacact gctcatgtct ccaggcatct atttgcactt taggaggtgt cgtgggagac 300 

tgagaggtct attttttcca tatttgggca actacta 337 

<210> 14 

<211> 571 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 435, 441, 451, 456, 462, 479, 488, 489, 509, 568 
<223> n = A,T,C or G 

<400> 14 

tagtagttgc catacagtgc ctttccattt atttaacccc cacctgaacg gcataaactg 60 

agtgttcagc tggtgttttt tactgtaaac aataaggaga ctttgctctt catttaaacc 120 

aaaatcatat ttcatatttt acgctcgagg gtttttaccg gttccttttt acactcctta 180 

aaacagtttt taagtcgttt ggaacaagat attttttctt tcctggcagc ttttaacatt 240 

atagcaaatt tgtgtctggg ggactgctgg tcactgtttc tcacagttgc aaatcaaggc 300 

atttgcaacc aagaaaaaaa aatttttttg ttttatttga aactggaccg gataaacggt 360 

gtttggagcg gctgctgtat atagttttaa atggtttatt gcacctcctt aagttgcact 420 

tatgtggggg ggggnttttg natagaaagt ntttantcac anagtcacag ggacttttnt 480 
cttttggnna ctgagctaaa aagggctgnt tttcgggtgg gggcagatga aggctcacag 540 

gaggcctttc tcttagaggg gggaactnct a 57 1 



<210> 15 
<211> 548 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 224, 291, 326, 376, 388, 394, 428, 433, 507, 514 
<223> n = A,T,C or G 

<400> 15 

tatatattta ataacttaaa tatattttga tcacccactg gggtgataag acaatagata 60 
taaaagtatt tccaaaaagc ataaaaccaa agtatcatac caaaccaaat tcatactgct 
tcccccaccc gcactgaaac ttcaccttct aactgtctac ctaaccaaat tctacccttc 
aagtctttgg tgcgtgctca ctactctttt tttttttttt tttnttttgg agatggagtc 240 
tggctgtgca gcccaggggt ggagtacaat ggcacaacct cagctcactg naacctccgc 300 
ctcccaggtt catgagattc tcctgnttca gccttcccag tagctgggac tacaggtgtg 360 
catcaccatg cctggntaat cttttttngt tttngggtag agatgggggt tttacatgtt 420 
ggccaggntg gtntcgaact cctgacctca agtgatccac ccacctcagg ctcccaaagt 480 
gctaggatta cagacatgag ccactgngcc cagncctggt gcatgctcac ttctctaggc 540 
aactacta 



120 
180 



548 
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<210> 16 

<211> 638 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc — feature 
<222> 471, 488 
<223> n = A,T,C or G 

<400> 16 

ttccgttatg cacatgcaga atattctatc ggtacttcag ctattactca ttttgatggc 60 
gcaatccgag cctatcctca agatgagtat ttagaaagaa ttgatttagc gatagaccaa 120 
gctggtaagc actctgacta cacgaaattg ttcagatgtg atggatttat gacagttgat 180 
ctttggaaga gattattaag tgattatttt aaagggaatc cattaattcc agaatatctt 240 
ggtttagctc aagatgatat agaaatagaa cagaaagaga ctacaaatga agatgtatca 300 
ccaactgata ttgaagagcc tatagtagaa aatgaattag ctgcatttat tagccttaca 360 
catagcgatt ttcctgatga atcttatatt cagccatcga catagcatta cctgatgggc 420 
aaccttacga ataatagaaa ctgggtgcgg ggctattgat gaattcatcc ncagtaaatt 480 
tggatatnac aaaatataac tcgattgcat ttggatgatg gaatactaaa tctggcaaaa 540 
gtaactttgg agctactagt aacctctctt tttgagatgc aaaattttct tttagggttt 600 
cttattctct actttacgga tattggagca taacggga 638 

<210> 17 

<211> 286 

<212> DNA 

<213> Homo sapiens 

<400> 17 

actgatggat gtcgccggag gcgaggggcc ttatctgatg ctcggctgcc tgttcgtgat 60 
gtgcgcggcg attgggctgt ttatctcaaa caccgccacg gcggtgctga tggcgcctat 120 
tgccttagcg gcggcgaagt caatgggcgt ctcaccctat ccttttgcca tggtggtggc 180 
gatggcggct tcggcggcgt ttatgacccc ggtctcctcg ccggttaaca ccctggtgct 240 
tggccctggc aagtactcat ttagcgattt tgtcaaaata ggcgtg 286 

<210> 18 

<211> 262 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 184, 234, 240 
<223> n » A,T,C or G 

<400> 18 

tcggtcatag cagccccttc ttctcaattt catctgtcac taccctggtg tagtatctca 60 

tagccttaca tttttatagc ctcctccctg gtctgtcttt tgattttcct gcctgtaatc 120 

catatcacac ataactgcaa gtaaacattt ctaaagtgtg gttatgctca tgtcactcct 180 

gtgncaagaa atagtttcca ttaccgtctt aataaaattc ggatttgttc tttnctattn 240 
tcactcttca cctatgaccg aa 262 

<210> 19 

<211> 261 

<212> DNA 

<213> Homo sapiens 
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<400> 19 

tcggtcatag caaagccagt ggtttgagct 
atttatgata aatggtggca ggatttttat 
aactctgaga tatattcttc tacatttaaa 
atttgcgtag ttaggtaaga gtgtttaatg 
cgtttctctg cctatgaccg a 



ctctactgtg taaactccta aaccaaggcc 60 
tataaacatg tacccatgca aatttcctat 120 
caataaaaat aatctatttt taaaagccta 180 
agagggtata aggtataaat caccagtcaa 240 



<210> 20 

<211> 294 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> 194, 274, 283, 294 
<223> n = A,T,C or G 



<400> 20 £A 
tacaacgagg cgacgtcggt aaaatcggac atgaagccac cgctggtctt ttcgtccgag 60 
cgataggcgc cggccagcca gcggaacggt tgcccggatg gcgaagcgag ccggagttct 120 
tcggactgag tatgaatctt gttgtgaaaa tactcgccgc cttcgttcga cgacgtcgcg 180 
tcgaaatctt cganctcctt acgatcgaag tcttcgtggg cgacgatcgc ggtcagttcc 240 
gccccaccga aatcatggtt gagccggatg ctgnccccga agncctcgtt tgtn 294 

<210> 21 

<211> 208 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 116, 132, 140, 160, 164, 191, 197, 199 
<223> n = A,T,C or G 

<400> 21 

ttggtaaagg gcatggacgc agacgcctga cgtttggctg aaaatctttc attgattcgt 60 
atcaatgaat aggaaaattc ccaaagaggg aatgtcctgt tgctcgccag tttttntgtt 120 
gttctcatgg anaaggcaan gagctcttca gactattggn attntcgttc ggtcttctgc 180 
caactagtcg ncttgcnang atcttcat 



<210> 22 

<211> 287 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 1, 4, 25, 121, 168, 207, 212 
<223> n - A,T,C or G 



<400> 22 

nccnttgagc tgagtgattg agatntgtaa 
gtggcgggtc acccggcagt gggtctcccg 
ngtgcgcatc gctcgactat atgctatggc 
ggcgctggag ctttccacgg tccatgnatt 
gcgtgatggt acgctggctg gagcattgat 



tggttgtaag ggtgattcag gcggattagg 60 
acaggccagc aggatttggg gcaggtacgg 120 
aggcgagccg tggaaggngg atcaggtcac 180 
gngatggctg ttctaggcgg ctgttgccaa 240 
ttctggtgcc aaggtgg 28 7 



<210> 23 
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<211> 204 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 40, 121, 131, 162, 184, 197 

<223> n = A,T,C or G 

<400> 23 

ttgggtaaag ggagcaagga gaaggcatgg agaggctcan gctggtcctg gcctacgact 60 
gggccaagct gtcgccgggg atggtggaga actgaagcgg gacctcctcg aggtcctccg 120 
ncgttacttc nccgtccagg aggagggtct ttccgtggtc tnggaggagc ggggggagaa 180 
gatnctcctc atggtcnaca tccc 204 

<210> 24 
<211> 264 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 171, 206 
<223> n - A,T,C or G 

<400> 24 

tggattggtc aggagcgggt agagtggcac cattgagggg atattcaaaa atattatttt 60 
gtcctaaatg atagttgctg agtttttctt tgacccatga gttatattgg agtttatttt 120 
ttaactttcc aatcgcatgg acatgttaga cttattttct gttaatgatt nctattttta 180 
ttaaattgga tttgagaaat tggttnttat tatatcaatt tttggtattt gttgagtttg 240 
acattatagc ttagtatgtg acca 264 

<210> 25 

<211> 376 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 103, 111, 192, 196, 199, 220, 224, 230, 251, 268, 283, 317, 

352, 370, 374 

<223> n = A,T,C or G 

<400> 25 

ttacaacgag gggaaactcc gtctctacaa aaattaaaaa attagccagg tgtggtggtg 60 
tgcacccgca atcccagcta cttgggaggt tgagacacaa gantcaccta natgtgggag 120 
gtcaaggttg catgagtcat gattgtgcca ctgcactcca gcctgggtga cagaccgaga 180 
ccctgcctca anaganaang aataggaagt tcagaaatcn tggntgtggn gcccagcaat 240 
ctgcatctat ncaacccctg caggcaangc tgatgcagcc tangttcaag agctgctgtt 300 
tctggaggca gcagttnggg cttccatcca gtatcacggc cacactcgca cnagccatct 360 
gtcctccgtn tgtnac 376 

<210> 26 

<211> 372 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> misc_feature 
<222> 231 f 312, 340 
<223> n = A,T,C or G 

<400> 26 

ttacaacgag gggaaactcc gtctctacaa aaattaaaaa attagccagg tgtggtggtg 60 
tgcacctgta atcccagcta cttgggcggc tgagacacaa gaaccaccta aatgtgggag 120 
ggtcaaggtt gcatgagtca tgatcgcgcc actgcactcc agcctgggtg acagactgag 180 
accctgcctc aaaagaaaaa gaataggaag ttcagaaacc ctgggtgtgg ngcccagcaa 240 
tctgcattta aacaatccct gcaggcaatg ctgatgcagc ctaagttcaa gagctgctgt 300 
tctggaggca gnagtaaggg cttccatcca gcatcacggn caacactgca aaagcacctg 360 
tcctcgttgg ta 372 

<210> 27 

<211> 477 

<212> DNA 

<213> Homo sapiens 

<400> 27 

ttctgtccac atctacaagt tttatttatt ttgtgggttt tcagggtgac taagtttttc 60 

cctacattga aaagagaagt tgctaaaagg tgcacaggaa atcatttttt taagtgaata 120 

tgataatatg ggtccgtgct taatacaact gagacatatt tgttctctgt ttttttagag 180 

tcacctctta aagtccaatc ccacaatggt gaaaaaaaaa tagaaagtat ttgttctacc 240 

tttaaggaga ctgcagggat tctccttgaa aacggagtat ggaatcaatc ttaaataaat 300 

atgaaattgg ttggtcttct gggataagaa attcccaact cagtgtgctg aaattcacct 360 

gacttttttt gggaaaaaat agtcgaaaat gtcaatttgg tccataaaat acatgttact 420 

attaaaagat atttaaagac aaattctttc agagctctaa gattggtgtg gacagaa 477 

<210> 28 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 4, 16, 30, 255, 413 

<223> n = A,T,C or G 

<400> 28 

tctncaacct cttgantgtc aaaaaccttn taggctatct ctaaaagctg actggtattc 60 
attccagcaa aatccctcta gtttttggag tttcctttta ctatctgggg ctgcctgagc 120 
cacaaatgcc aaattaagag catggctatt ttcgggggct gacaggtcaa aaggggtgta 180 
aatccgataa gcctcctgga ggtgctctaa aaacactcct ggtgactcat catgcccctg 240 
gacgacttca atcgncttag acaagtttat aggtttctgg gcagctccct gaatacccac 300 
gaggagatac cggtggaaat cgtcaaaagt tctccctcca cttgagaaat ttgggtccca 360 
attaggtccc aattgggtct ctaatcacta ttcctctagc ttcctcctcc ggnctattgg 420 



ttgatgtgag gttgaaga 

<210> 29 

<211> 620 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 391, 481, 483, 490, 497, 510, 527, 532, 540, 545, 593, 612 

<223> n = A,T,C or G 



438 



WO0301 3431 fht tp : //wv vw.qetthepatent.com/Login.doq/$patents4/@21 01 2 1 %2E419C1 2/Fetch/WO0301 3431 .cp c? fro mCache= 1 part =m ai ntool b ar ==i : F a qe 1 81 of 308 



WO 03/013431 PCT/US02/24917 

13 



<400> 29 

aagagggtac cagccccaag ccttgacaac ttccataggg tgtcaagcct gtgggtgcac 60 

agaagtcaaa aattgagttt tgggatcctc agcctagatt tcagaggata taaagaaaca 120 

cctaacacct agatattcag acaaaagttt actacaggga tgaagctttc acggaaaacc 180 

tctactagga aagtacagaa gagaaatgtg ggtttggagc ccccaaacag aatcccctct 240 

agaacactgc ctaatgaaac tgtgagaaga tggccactgt catccagaca ccagaatgat 300 

agacc caeca aaaacttatg ccatattgcc tataaaacct acagacactc aatgccagcc 360 

ccatgaaaaa aaaactgaga agaagactgt nccctacaat gccaccggag cagaactgcc 420 

ccaggccatg gaagcacagc tcttatatca atgtgacctg gatgttgaga catggaatcc 480 

nangaaatcn ttttaanact tccacggttn aatgactgee ctattanatt cngaacttan 540 

atccnggcct gtgacctctt tgctttggcc attccccctt tttggaatgg ctnttttttt 600 

cccatgcctg tnccctctta 620 

<210> 30 

<211> 100 

<212> DNA 

<213> Homo sapiens 

<400> 30 

ttacaacgag ggggtcaatg tcataaatgt cacaataaaa caatctcttc tttttttttt 60 
tttttttttt tttttttttt tttttttttt tttttttttt 100 

<210> 31 

<211> 762 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 626, 652, 662, 715, 736 

<223> n - A,T f C or G 

<400> 31 

tagtctatgc geeggacaga gcagaattaa attggaagtt gccctccgga ctttctaccc 60 

acactcttcc tgaaaagaga aagaaaagag gcaggaaaga ggttaggatt tcattttcaa 120 

gagtcagcta attaggagag cagagtttag acagcagtag gcaccccatg atacaaacca 180 

tggacaaagt ccctgtttag taactgccag acatgatcct gctcaggttt tgaaatctct 240 

ctgcccataa aagatggaga gcaggagtgc catccacatc aacacgtgtc caagaaagag 300 

tctcagggag acaagggtat caaaaaacaa gattcttaat gggaaggaaa tcaaaccaaa 360 

aaattagatt tttctctaca tatatataat atacagatat ttaacacatt attccagagg 420 

tggctccagt ccttggggct tgagagatgg tgaaaacttt tgttccacat taacttctgc 480 

tctcaaattc tgaagtatat cagaatggga caggcaatgt tttgctccac actggggcac 540 

agacccaaat ggttctgtgc ccgaagaaga gaagcccgaa agacatgaag gatgettaag 600 

gggggttggg aaagccaaat tggtantatc ttttcctcct gcctgtgttc cngaagtctc 660 

enctgaagga attcttaaaa ccctttgtga ggaaatgece ccttaccatg acaantggtc 720 

ecattgettt tagggngatg gaaacaccaa gggttttgat cc 762 

<210> 32 

<211> 276 

<212> DNA 

<213> Homo sapiens 

<400> 32 

tagtctatgc gtgtattaac ctcccctccc tcagtaacaa ccaaagaggc aggagctgtt 60 

attaccaacc ccattttaca gatgeatcaa taatgacaga gaagtgaagt gaettgegea 120 

cacaaccagt aaattggcag agtcagattt gaatccatgg agtctggtct gcactttcaa 180 

tcaccgaata ccctttctaa gaaacgtgtg ctgaatgagt gcatggataa atcagtgtct 240 

actcaacatc tttgectaga tatcccgcat agacta 276 
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<210> 33 

<211> 477 

<212> DNA 

<213> Homo sapiens 

<400> 33 

tagtagttgc caaatatttg aaaatttacc cagaagtgat tgaaaacttt ttggaaacaa 60 
aaacaaataa agccaaaagg taaaataaaa atatctttgc actctcgtta ttacctatcc 120 
ataacttttt caccgtaagc tctcctgctt gttagtgtag tgtggttata ttaaactttt 180 
tagttattat tttttattca cttttccact agaaagtcat tattgattta gcacacatgt 240 
tgatctcatt tcattttttc tttttatagg caaaatttga tgctatgcaa caaaaatact 300 
caagcccatt atcttttttc cccccgaaat ctgaaaattg caggggacag agggaagtta 360 
tcccattaaa aaattgtaaa tatgttcagt ttatgtttaa aaatgcacaa aacataagaa 420 
aattgtgttt acttgagctg ctgattgtaa gcagttttat ctcaggggca actacta 4 77 

<210> 34 

<211> 631 

<212> DNA 

<213> Homo sapiens 

<400> 34 

tagtagttgc caattcagat gatcagaaat gctgctttcc tcagcattgt cttgttaaac 60 
cgcatgccat ttggaacttt ggcagtgaga agccaaaagg aagaggtgaa tgacatatat 120 
atatatatat attcaatgaa agtaaaatgt atatgctcat atactttcta gttatcagaa 180 
tgagttaagc tttatgccat tgggctgctg catattttaa tcagaagata aaagaaaatc 240 
tgggcatttt tagaatgtga tacatgtttt tttaaaactg ttaaatatta tttcgatatt 300 
tgtctaagaa ccggaatgtt cttaaaattt actaaaacag tattgtttga ggaagagaaa 360 
actgtactgt ttgccattat tacagtcgta caagtgcatg tcaagtcacc cactctctca 420 
ggcatcagta tccacctcat agctttacac attttgacgg ggaatattgc agcatcctca 480 
ggcctgacat ctgggaaagg ctcagatcca cctactgctc cttgctcgtt gatttgtttt 540 
aaaatattgt gcctggtgtc acttttaagc cacagccctg cctaaaagcc agcagagaac 600 
agaacccgca ccattctata ggcaactact a 631 

<210> 35 

<211> 578 

<212> DNA 

<213> Homo sapiens 

<400> 35 

tagtagttgc catcccatat tacagaaggc tctgtataca tgacttattt ggaagtgatc 60 
tgttttctct ccaaacccat ttatcgtaat ttcaccagtc ttggatcaat cttggtttcc 120 
actgatacca tgaaacctac ttggagcaga cattgcacag ttttctgtgg taaaaactaa 180 
aggtttattt gctaagctgt catcttatgc ttagtatttt ttttttacag tggggaattg 240 
ctgagattac attttgttat tcattagata ctttgggata acttgacact gtcttctttt 300 
tttcgctttt aattgctatc atcatgcttt tgaaacaaga acacattagt cctcaagtat 360 
tacataagct tgcttgttac gcctggtggt ttaaaggact atctttggcc tcaggttcac 420 
aagaatgggc aaagtgtttc cttatgttct gtagttctca ataaaagatt gccaggggcc 480 
gggtactgtg gctcgcactg taatcccagc actttgggaa gctgaggctg gcggatcatg 540 
ttagggcagg tgttcgaaac cagcctgggc aactacta 578 

<210> 36 

<211> 583 

<212> DNA 

<213> Homo sapiens 



<400> 36 

tagtagttgc ctgtaatccc agcaactcag gaggctgggg caggagaatc agttgaacct 60 
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gggaggcaga agttgtaatt agcaaagatc gcaccattgc acttcagcct gggcaacaag 120 
agtgagattc catctcaaaa acaaaaaaaa gaaaaagaaa agaaaaggaa aaaacgtata 180 
aacccagcca aaacaaaatg atcattcttt taataagcaa gactaattta atgtgtttat 240 
ttaatcaaag cagttgaatc ttctgagtta ttggtgaaaa tacccatgta gttaatttag 300 
ggttcttact tgggtgaacg tttgatgttc acaggttata aaatggttaa caaggaaaat 360 
gatgcataaa gaatcttata aactactaaa aataaataaa atataaatgg ataggtgcta 420 
tggatggagt ttttgtgtaa tttaaaatct tgaagtcatt ttggatgctc attggttgtc 480 
tggtaatttc cattaggaaa aggttatgat atggggaaac tgtttctgga aattgcggaa 540 
tgtttctcat ctgtaaaatg ctagtatctc agggcaacta eta 583 

<210> 37 

<211> 716 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 15, 669, 673, 678, 686, 704 

<223> n = A,T,C or G 

<400> 37 

gatctactag tcatntggat tctatccatg gcagctaagc ctttctgaat ggattctact 60 

gctttcttgt tctttaatcc agacccttat atatgtttat gttcacaggc agggcaatgt 120 

ttagtgaaaa caattctaaa ttttttattt tgeattttea tgetaattte cgtcacactc 180 

cagcaggctt cctgggagaa taaggagaaa tacagctaaa gacattgtcc ctgettaett 240 

acagectaat ggtatgcaaa accacttcaa taaagtaaca ggaaaagtac taaccaggta 300 

gaatggacca aaactgatat agaaaaatca gaggaagaga ggaacaaata tttactgagt 360 

cctagaatgt acaaggcttt ttaattacat attttatgta aggectgeaa aaaacaggtg 420 

agtaatcaac atttgtccca ttttacatat aaggaaactg aagcttaaat tgaataattt 480 

aatgeataga ttttatagtt agaccatgtt caggtcccta tgttatactt actagctgta 540 

tgaatatgag aaaataattt tgttattttc ttggcatcag tattttcatc tgcaaaataa 600 

agctaaagtt atttagcaaa cagtcagcat agtgcctgat acatagtagg tgctccaaac 660 

atgattacne tantattngg tattanaaaa atccaatata ggcntggata aaaccg 716 

<210> 38 

<211> 688 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 260 

<223> n = A,T,C or G 
<400> 38 

ttctgtccac atatcatccc actttaattg ttaatcagca aaactttcaa tgaaaaatca 60 
tccattttaa ccaggatcac accaggaaac tgaaggtgta ttttttttta ccttaaaaaa 120 
aaaaaaaaaa accaaacaaa ccaaaacaga ttaacagcaa agagttctaa aaaatttaca 180 
tttctcttac aactgtcatt cagagaacaa tagttcttaa gtctgttaaa tcttggcatt 240 
aacagagaaa cttgatgaan agttgtactt ggaatattgt ggattttttt ttttgtctaa 300 
tctcccccta ttgttttgcc aacagtaatt taagtttgtg tggaacatcc ccgtagttga 360 
agtgtaaaca atgtatagga aggaatatat gataagatga tgcatcacat atgeattaca 420 
tgtagggacc ttcacaactt catgcactca gaaaacatgc ttgaagagga ggagaggacg 480 
gcccagggtc accatccagg tgecttgagg acagagaatg cagaagtggc actgttgaaa 540 
tttagaagac catgtgtgaa tggtttcagg cctgggatgt ttgccaccaa gaagtgcctc 600 
cgagaaattt ctttcccatt tggaatacag ggtggcttga tgggtacggt gggtgaccca 660 
acgaagaaaa tgaaattctg ccctttcc 688 
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<210> 39 

<211> 585 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 14, 15, 24, 53, 108, 135, 465, 477, 495, 499, 504, 517, 530, 
580, 581 

<223> n = A,T,C or G 
<400> 39 

tagtagttgc cgcnnaceta aaanttggaa agcatgatgt ctaggaaaca tantaaaata 60 
gggtatgcct atgtgctaca gagagatgtt agcatttaaa gtgcatantt ttatgtattt 120 
tgacaaatgc atatncctct ataatccaca actgattacg aagctattac aattaaaaag 180 
tttggccggg cgtggtgggc ggtggctgac gcctgtaatc ccagcacttt gggaggccga 240 
ggcacgcgga tcacgaggtc gggagttcaa gaccatcctg gctaacacgg tgaaagtcca 300 
tctctactaa aaatacgaaa aaattacccc ggcgtggtgg cgggcgcctg tagtcccagc 360 
tactccggag gctgaggcag gagaatggcg tgaacccagg acacggagct tgcagtgtgc 420 
caacatcacg tcactgccct ccagcctggg ggacaggaac aagantcccg tcctcanaaa 480 
agaaaaatac tactnatant ttcnacttta ttttaantta cacagaactn cctcttggta 540 
cccccttacc attcatctca cccacctcct atagggcacn nctaa 585 

<210> 40 

<211> 475 

<212> DNA 

<213> Homo sapiens 

<400> 40 

tctgtccaca ccaatcttag aagctctgaa aagaatttgt ctttaaatat cttttaatag 60 

taacatgtat tttatggacc aaattgacat tttcgactgt tttttccaaa aaagtcaggt 120 

gaatttcagc acactgagtt gggaatttct tatcccagaa gaccaaccaa tttcatattt 180 

atttaagatt gattccatac tccgttttca aggagaatcc ctgcagtctc cttaaaggta 240 

gaacaaatac ttcctatttt tttttcacca ttgtgggatt ggactttaag aggtgactct 300 

aaaaaaacag agaacaaata tgtctcagtt gtattaagca cggacccata ttatcatatt 360 

cacttaaaaa aatgatttcc tgtgcacctt ttggcaactt ctcttttcaa tgtagggaaa 420 

aacttagtca ccctgaaaac ccacaaaata aataaaactt gtagatgtgg acaga 475 

<210> 41 

<211> 423 

<212> DNA 

<213> Homo sapiens 

<400> 41 

taagagggta catcgggtaa gaacgtaggc acatctagag cttagagaag tctggggtag 60 

gaaaaaaatc taagtattta taagggtata ggtaacattt aaaagtaggg ctagctgaca 120 

ttatttagaa agaacacata cggagagata agggcaaagg actaagacca gaggaacact 180 

aatatttagt gatcacttcc attcttggta aaaatagtaa cttttaagtt agcttcaagg 240 

aagatttttg gccatgatta gttgtcaaaa gttagttctc ttgggtttat attactaatt 300 

ttgttttaag atccttgtta gtgctttaat aaagtcatgt tatatcaaac gctctaaaac 360 

attgtagcat gttaaatgtc acaatatact taccatttgt tgtatatggc tgtaccctct 420 

eta 423 

<210> 42 

<211> 527 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 
<222> 470, 475, 515, 522 
<223> n = A,T,C or G 

<400> 42 

tctcctaggc taatgtgtgt gtttctgtaa aagtaaaaag ttaaaaattt taaaaataga 60 
aaaaagctta tagaataaga atatgaagaa agaaaatatt tttgtacatt tgcacaatga 120 
gtttatgttt taagctaagt gttattacaa aagagccaaa aaggttttaa aaattaaaac 180 
gtttgtaaag ttacagtacc cttatgttaa tttataattg aagaaagaaa aacttttttt 240 
tataaatgta gtgtagccta agcatacagt atttataaag tctggcagtg ttcaataatg 300 
tcctaggcct tcacattcac tcactgactc acccagagca acttccagtc ctgtaagctc 360 
cattcgtggt aagtgcccta tacaggtgca ccatttattt tacagtattt ttactgtacc 420 
ttctctatgt ttccatatgt ttcgatatac aaataccact ggttactatn gcccnacagg 480 
taattccagt aacacggcct gtatacgtct ggtancccta gngaaga 527 

<210> 43 
<211> 331 
<212> DNA 

<213> Homo sapiens 
<400> 43 

tcttcaacct cgtaggacaa ctctcatatg cctgggcact atttttaggt tactaccttg 60 
gctgcccttc tttaagaaaa aaaaaagaag aaaaaagaac ttttccacaa gtttctcttc 120 
ctctagttgg aaaattagag aaatcatgtt tttaattttg tgttatttca gatcacaaat 180 
tcaaacactt gtaaacatta agcttctgtt caatcccctg ggaagaggat tcattctgat 240 
atttacggtt caaaagaagt tgtaatattg tgcttggaac acagagaacc agttattaac 300 
ttcctactac tattatataa taaataataa c 331 

<210> 44 
<211> 592 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 473 

<223> n = A,T,C or G 
<400> 44 

ggcttagtag ttgccaggca aaatarcgtt gattctcctc aggagccacc cccaacaccc 60 
ctgtttgctt ctagacctat acctagacta aagtcccagc agacccctag aggtgaggtt 120 
cagagtgacc cttgaggaga tgtgctacac tagaaaagaa ctgcttgagt tttctaattt 180 
atataagcag aaatctggag aagagtcata ggaatggata ttaagggtgt gagataatgg 240 
cggaaggaat atagagttgg atcaggctgg acttattgat ttgaacccac taagtagaga 300 
ttctgctttt gatgttgcag ctcagggagt taaaaaaggt tttaatggtt ctaatagttt 360 
atttgcttgg ttagctgaaa tatggataaa agatggccca ctgtgagcaa gctggaaatg 420 
cctgatctct ctcagtttaa tgtagaggaa gggatccaaa agtttaggga ganttggatg 480 
ctggraktgg attggtcact ttgrgaccta cccwtcccag ctgggagggt ccagaagata 540 
cacccttgac caacgctttg cgaaatggat ttgtgatggc ggcaactact aa 592 

<210> 45 
<211> 567 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 
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<222> 522, 561, 566 
<223> n = A,T,C or G 

<400> 45 

ggcttagtag ttgccattgc gagtgcttgc tcaacgagcg ttgaacatgg cggattgtct 60 

agattcaacg gatttgagtt ttaccagcaa agcgaaccaa gcgcggccca gagaattatg 120 

ggttggttgg ctttgaaaag atggaaatcc tgtaggccta gtcagaaaag ccttcttgca 180 

gaacagttgg ttctcgggcg aacgctcatc aagatgccca ttggaaaggc tagcgtgtat 240 

ttgggagagc ctgatagcgt gtcttctgat gatgtttgtg cttggacagt gacaaaagat 300 

atgcaaagca agtccgaact agacgtcaag cttcgtgagc aaattattgt agactcctac 360 

ttatactgtg aggaatgata gccaagggtg gggactttaa gactaaggtg gtttgtactt 420 

gcgccgatga tcccaggcag aaagamctga tcgctagttt tatacgggca actactaagc 480 
cgaattccag cacactggcg gccgttacta attggatccg anctcggtac cagcttgatg 540 

catascttga gttwtctata ntgtcnc 567 

<210> 46 

<211> 908 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 21, 23, 24, 27, 29, 34 

<223> n - A,T,C or G 

<400> 46 

gagcgaaaga ccgagggcag ngnntangng cgangaagcg gagagggcca aaaagcaacc 60 
gctttccccg gggggtgccg attcattaag gcaggtggag gacaggtttc ccgatggaag 120 
gcggcagggg cgcaagcaat taatgtgagt aggccattca ttagcacccg ggcttaacat 180 
ttaagcttcg ggttggtatg tggtgggaat tgtgagcgga taacaatttc acacaggaaa 240 
cagctatgac catgattacg ccaagctatt taggtgacat tatagaataa ctcaagttat 300 
gcatcaagct tggtaccgag ttcggatcca ctagtaacgg ccgccagtgt gtggaattcg 360 
gcttagtagt tgccgaccat ggagtgctac ctaggctaga atacctgagy tcctccctag 420 
cctcactcac attaaattgt atcttttcta cattagatgt cctcagcgcc ttatttctgc 480 
tggacwatcg ataaattaat cctgatagga tgatagcagc agattaatta ctgagagtat 540 
gttaatgtgt catccctcct atataacgta tttgcatttt aatggagcaa ttctggagat 600 
aatccctgaa ggcaaaggaa tgaatcttga gggtgagaaa gccagaatca gtgtccagct 660 
gcagttgtgg gagaaggtga tattatgtat gtctcagaag tgacaccata tgggcaacta 720 
ctaagcccga attccagcac actggcgggc gttactaatg gatccgagct cggtaccaag 780 
cttgatgcat agcttgagta tctatagtgt cactaaatag cctggcgtta tcatggtcat 840 
agctgtttcc tgtgtgaaat tgttatccgc tcccaattcc ccccaccata cgagccggaa 900 
cataaagt 908 



<210> 47 

<211> 480 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 408, 461 

<223> n = A,T,C or G 



<400> 47 

tgccaacaag gaaagtttta aatttcccct tgaggattct tggtgatcat caaattcagt 60 

ggtttttaag gttgttttct gtcaaataac tctaacttta agccaaacag tatatggaag 120 

cacagataka atattacaca gataaaagag gagttgatct aaagtaraga tagttggggg 180 

ctttaatttc tggaacctag gtctccccat cttcttctgt gctgaggaac ttcttggaag 240 
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cggggattct aaagttcttt ggaagacagt ttgaaaacca ccatgttgtt ctcagtacct 300 
ttatttttaa aaagtaggtg aacattttga gagagaaaag ggcttggttg agatgaagtc 360 
cccccccccc cttttttttt ttttagctga aatagatacc ctatgttnaa rgaarggatt 420 
attatttacc atgccaytar scacatgctc tttgatgggc nyctccstac cctccttaag 480 



<210> 48 

<211> 591 

<212> DNA 

<213> Homo sapiens 

<400> 48 

aagagggtac cgagtggaat ttccgcttca ctagtctggt gtggctagtc ggtttcgtgg 60 
tggccaacat tacgaacttc caactcaacc gttcttggac gttcaagcgg gagtaccggc 120 
gaggatggtg gcgtgaattc tggcctttct ttgccgtggg atcggtagcc gccatcatcg 180 
gtatgtttat caagatcttc tttactaacc cgacctctcc gatttacctg cccgagccgt 240 
ggtttaacga ggggaggggg atccagtcac gcgagtactg gtcccagatc ttcgccatcg 300 
tcgtgacaat gcctatcaac ttcgtcgtca ataagttgtg gaccttccga acggtgaagc 360 
actccgaaaa cgtccggtgg ctgctgtgcg gtgactccca aaatcttgat aacaacaagg 420 
taaccgaatc gcgctaagga accccggcat ctcgggtact ctgcatatgc gtacccctta 480 
agccgaattc cagcacactg gcggccgtta ctaattggat ccgaactccg taaccaagcc 540 
tgatgcgtaa cttgagttat tctatagtgt ccctaaaata acctggcgtt a 591 

<210> 49 

<211> 454 

<212> DNA 

<213> Homo sapiens 

<400> 49 

aagagggtac ctgccttgaa atttaaatgt ctaaggaaar tgggagatga ttaagagttg 60 

gtgtggcyta gtcacaccaa aatgtattta ttacatcctg ctcctttcta gttgacagga 120 

aagaaagctg ctgtggggaa aggagggata aatactgaag ggatttacta aacaaatgtc 180 

catcacagag ttttcctttt tttttttttg agacagagtc ttgctctgtc acccaggctg 240 

gaatgaagwg gtatgatctc agttgaatgc aacctctacc tcctaggttc aagcgattct 300 

catgcctcag cctcctgagc agctgggact ataggcgcat gctaccatgc caggctaatt 360 

tttatatttt tattagagac ggggtgttgc catgttggcc aggcaggtct cgaactcctg 420 
ggcctcagat gatctgcccc accgtaccct ctta 454 

<210> 50 

<211> 463 

<212> DNA 

<213> Homo sapiens 

<400> 50 

aagagggtac caaaaaaaag aaaaaggaaa aaaagaaaaa caacttgtat aaggctttct 60 
gctgcataca gctttttttt tttaaataaa tggtgccaac aaatgttttt gcattcacac 120 
caattgctgg ttttgaaatc gtactcttca aaggtatttg tgcagatcaa tccaatagtg 180 
atgccccgta ggttttgtgg actgcccacg ttgtctacct tctcatgtag gagccattga 240 
gagactgttt ggacatgcct gtgttcatgt agccgtgatg tccgggggcc gtgtacatca 300 
tgttaccgtg gggtggggtc tgcattggct gctgggcata tggctgggtg cccatcatgc 360 
ccatctgcat ctgcataggg tattggggcg tttgatccat atagccatga ttgctgtggt 420 
agccactgtt catcattggc tgggacatgc tgttaccctc tta 463 

<210> 51 

<211> 399 

<212> DNA 

<213> Homo sapiens 
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<400> 51 

cttcaacctc ccaaagtgct gggattacag gactgagcca ccacgctcag cctaagcctc 60 
tttttcacta ccctctaagc gatctaccac agtgatgagg ggctaaagag cagtgcaatt 120 
tgattacaat aatggaactt agatttatta attaacaatt tttccttagc atgttggttc 180 
cataattatt aagagtatgg acttacttag aaatgagctt tcattttaag aatttcatct 240 
ttgaccttct ctattagtct gagcagtatg acactatacg tattttattt aactaaccta 300 
ccttgagcta ttacttttta aaaggctata tacatgaatg tgtattgtca actgtaaagc 360 
cccacagtat ttaattatat catgatgtct ttgaggttg 399 

<210> 52 

<211> 392 

<212> DNA 

<213> Homo sapiens 

<400> 52 

cttcaacctc aatcaacctt ggtaattgat aaaatcatca cttaactttc tgatataatg 60 

gcaataatta tctgagaaaa aaaagtggtg aaagattaaa cttgcatttc tctcagaatc 120 

ttgaaggata tttgaataat tcaaaagcgg aatcagtagt atcagccgaa gaaactcact 180 

tagctagaac gttggaccca tggatctaag tccctgccct tccactaacc agctgattgg 240 

ttttgtgtaa acctcctaca cgcttgggct tggtcgcctc atttgtcaaa gtaaaggctg 300 

aaataggaag ataatgaacc gtgtcttttt ggtctctttt ccatccatta ctctgatttt 360 

acaaagaggc ctgtattccc ctggtgaggt tg 392 

<210> 53 

<211> 179 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 135, 143, 179 
<223> n - A,T,C or G 

<400> 53 

ttcgggtgat gcctcctcag gctacagtga agactggatt acagaaaggt gccagcgaga 60 

tttcagattc ctgtaaacct ctaaagaaaa ggagtcgcgc ctcaactgat gtagaaatga 120 

ctagttcagc atacngagac acntctgact ccgattctag aggactgagt gacctgcan 179 

<210> 54 

<211> 112 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 31, 49, 54, 55, 75, 91, 107 

<223> n = A,T,C or G 

<400> 54 

ttcgggtgat gcctcctcag gctacatcat natagaagca aagtagaana atcnngtttg 60 
tgcattttcc cacanacaaa attcaaatga ntggaagaaa ttggganagt at 112 

<210> 55 

<211> 225 

<212> DNA 

<213> Homo sapiens 



<400> 55 
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tgagcttccg cttctgacaa ctcaatagat aatcaaagga caactttaac agggattcac 60 

aaaggagtat atccaaatgc caataaacat ataaaaagga attcagcttc atcatcatca 120 

gaagwatgca aattaaaacc ataatgagaa accactatgt cccactagaa tagataaaat 180 
cttaaaagac tggtaaaacc aagtgttggt aaggcaagag gagca 225 

<210> 56 

<211> 175 

<212> DNA 

<213> Homo sapiens 

<400> 56 

gctcctcttg ccttaccaac acattctcaa aaacctgtta gagtcctaag cattctcctg 60 

ttagtattgg gattttaccc ctgtcctata aagatgttat gtaccaaaaa tgaagtggag 120 

ggccataccc tgagggaggg gagggatctc tagtgttgtc agaagcggaa gctca 175 

<210> 57 

<211> 223 

<212> DNA 

<213> Homo sapiens 

<400> 57 

agccatttac cacccatgga tgaatggatt ttgtaattct agctgttgta ttttgtgaat 60 
ttgttaattt tgttgttttt ctgtgaaaca catacattgg atatgggagg taaaggagtg 120 
tcccagttgc tcctggtcac tccctttata gccattactg tcttgtttct tgtaactcag 180 
gttaggtttt ggtctctctt gctccactgc aaaaaaaaaa aaa 223 

<210> 58 

<211> 211 

<212> DNA 

<213> Homo sapiens 

<400> 58 

gttcgaaggt gaacgtgtag gtagcggatc tcacaactgg ggaactgtca aagacgaatt 60 

aactgacttg gatcaatcaa atgtgactga ggaaacacct gaaggtgaag aacatcatcc 120 

agtggcagac actgaaaata aggagaatga agttgaagag gtaaaagagg agggtccaaa 180 
agagatgact ttggatgggt ggtaaatggc t 211 

<210> 59 

<211> 208 

<212> DNA 

<213> Homo sapiens 

<400> 59 

gctcctcttg ccttaccaac tttgcaccca tcatcaacca tgtggccagg tttgcagccc 60 

aggctgcaca tcaggggact gcctcgcaat acttcatgct gttgctgctg actgatggtg 120 

ctgtgacgga tgtggaagcc acacgtgagg ctgtggtgcg tgcctcgaac ctgcccatgt 180 

cagtgatcat tatgggtggt aaatggct 208 

<210> 60 

<211> 171 

<212> DNA 

<213> Homo sapiens 

<400> 60 

agccatttac cacccatact aaattctagt tcaaactcca acttcttcca taaaacatct 60 

aaccactgac accagttggc aatagcttct tccttcttta acctcttaga gtatttatgg 120 

tcaatgccac acatttctgc aactgaataa agttggtaag gcaagaggag c 171 
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<210> 61 

<211> 134 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 37, 70, 80, 86, 88, 97, 117, 123, 131 
<223> n = A,T,C or G 

<400> 61 

cgggtgatgc ctcctcaggc tttggtgtgt ccactcnact cactggcctc ttctccagca 60 

actggtgaan atgtcctcan gaaaancncc acacgcngct cagggtgggg tgggaancat 120 
canaatcatc nggc 134 

<210> 62 

<211> 145 

<212> DNA 

<213> Homo sapiens 

<400> 62 

agagggtaca tatgcaacag tatataaagg aagaagtgca ctgagaggaa cttcatcaag 60 
gccatttaat caataagtga tagagtcaag gctcaaccca ggtgtgacgg attccaggtc 120 
ccaagctcct tactggtacc ctctt 145 

<210> 63 

<211> 297 

<212> DNA 

<213> Homo sapiens 

<400> 63 

tgcactgaga ggaattcaaa gggtttatgc caaagaacaa accagtcctc tgcagcctaa 60 

ctcatttgtt tttgggctgc gaagccatgt agagggcgat caggcagtag atggtccctc 120 

ccacagtcag cgccatggtg gtccggtaaa gcatttggtc aggcaggcct cgtttcaggt 180 

agacgggcac acatcagctt tctggaaaaa cttttgtagc tctggagctt tgtttttccc 240 

agcataatca tacactgtgg aatcggaggt cagtttagtt ggtaaggcaa gaggagc 297 

<210> 64 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 64 

gcactgagag gaacttccaa tactatgttg aataggagtg gtgagagagg gcatccttgt 60 
cttgtgccgg ttttcaaagg gaatgcttcc agcttttgcc cattcagtat aatattaaag 120 
aatgttttac cattttctgt cttgcctgtt tttctgtgtt tttgttggtc tcttcattct 180 
ccatttttag gcctttacat gttaggaata tatttctttt aatgatactt cacctttggt 240 
atcttttgtg agactctact catagtgtga taagcactgg gttggtaagg caagaggagc 



300 



<210> 65 

<211> 203 

<212> DNA 

<213> Homo sapiens 



<400> 65 

gctcctcttg ccttaccaac tcacccagta tgtcagcaat tttatcrgct ttacctacga 60 
aacagcctgt atccaaacac ttaacacact cacctgaaaa gttcaggcaa caatcgcctt 120 
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ctcatgggtc tctctgctcc agttctgaac ctttctcttt tcctagaaca tgcatttarg 180 
tcgatagaag ttcctctcag tgc 203 

<210> 66 

<211> 344 

<212> DNA 

<213> Homo sapiens 

<400> 66 

tacggggacc cctgcattga gaaagcgaga ctcactctga agctgaaatg ctgttgccct 60 

tgcagtgctg gtagcaggag ttctgtgctt tgtgggctaa ggctcctgga tgacccctga 120 

catggagaag gcagagttgt gtgccccttc tcatggcctc gtcaaggcat catggactgc 180 

cacacacaaa atgccgtttt tattaacgac atgaaattga aggagagaac acaattcact 240 

gatgtggctc gtaaccatgg atatggtcac atacagaggt gtgattatgt aaaggttaat 300 

tccacccacc tcatgtggaa actagcctca atgcaggggt ccca 344 

<210> 67 

<211> 157 

<212> DNA 

<213> Homo sapiens 

<400> 67 

gcactgagag gaacttcgta gggaggttga actggctgct gaggaggggg aacaacaggg 60 

taaccagact gatagccatt ggatggataa tatggtggtt gaggagggac actacttata 120 

gcagagggtt gtgtatagcc tgaggaggca tcacccg 157 

<210> 68 

<211> 137 

<212> DNA 

<213> Homo sapiens 

<400> 68 

gcactgagag gaacttctag aaagtgaaag tctagacata aaataaaata aaaatttaaa 60 
actcaggaga gacagcccag cacggtggct cacgcctgta atcccagaac tttgggagcc 120 
tgaggaggca tcacccg 137 

<210> 69 

<211> 137 

<212> DNA 

<213> Homo sapiens 

<400> 69 

cgggtgatgc ctcctcaggc tgtattttga agactatcga ctggacttct tatcaactga 60 

agaatccgtt aaaaatacca gttgtattat ttctacctgt caaaatccat ttcaaatgtt 120 

gaagttcctc tcagtgc 137 

<210> 70 

<211> 220 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 89, 112, 129, 171, 172 

<223> n = A,T,C or G 



<400> 70 

agcatgttga gcccagacac gcaatctgaa tgagtgtgca cctcaagtaa atgtctacac 60 
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gctgcctggt ctgacatggc acaccatcnc gtggagggca casctctgct cngcctacwa 120 
cgagggcant ctcatwgaca ggttccaccc accaaactgc aagaggctca nnaagtactr 180 
ccagggtmya sggacmasgg tgggaytyca ycacwcatct 220 

<210> 71 

<211> 353 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 66, 160, 204, 246, 267, 334, 339, 342 
<223> n - A,T,C or G 

<400> 71 

cgttagggtc tctatccact gctaaaccat acacctgggt aaacagggac catttaacat 60 

tcccanctaa atatgccaag tgacttcaca tgtttatctt aaagatgtcc aaaacgcaac 120 

tgattttctc ccctaaacct gtgatggtgg gatgattaan cctgagtggt ctacagcaag 180 

ttaagtgcaa ggtgctaaat gaangtgacc tgagatacag catctacaag gcagtacctc 240 

tcaacncagg gcaactttgc ttctcanagg gcatttagca gtgtctgaag taatttctgt 300 

attacaactc acggggcggg gggtgaatat ctantggana gnagacccta acg 353 

<210> 72 

<211> 343 

<212> DNA 

<213> Homo sapiens 

<400> 72 

gcactgagag gaacttccaa tacyatkatc agagtgaaca rgcarccyac agaacaggag 60 

aaaatgttyg caatctctcc atctgacaaa aggctaatat ccagawtcta awaggaactt 120 

aaacaaattt atgagaaaag aacaracaac ctcawcaaaa agtgggtgaa ggawatgcts 180 

aaargaagac atytattcag ccagtaaaca yatgaaaaaa aggctcatsa tcactgawca 240 

ttagagaaat gcaaatcaaa accacaatga gataccatct yayrccagtt agaayggtga 300 

tcattaaaar stcaggaaac aacagatgct ggacaaggtg tea 343 

<210> 73 

<211> 321 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 288 

<223> n = A,T,C or G 
<400> 73 

gcactgagag gaacttcaga gagagagaga gagttccacc ctgtacttgg ggagagaaac 60 
agaaggtgag aaagtctttg gttctgaagc agcttctaag atcttttcat ttgettcatt 120 
tcaaagttcc catgctgcca aagtgccatc ctttggggta ctgttttctg agctccagtg 180 
ataactcatt tatacaaggg agatacccag aaaaaaagtg agcaaatctt aaaaaggtgg 240 
cttgagttca gecttaaata ccatcttgaa atgacacaga gaaagaanga tgttgggtgg 300 
gagtggatag agaccctaac g 321 

<210> 74 

<211> 321 

<212> DNA 

<213> Homo sapiens 
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<400> 74 

gcactgagag gaacttcaga gagagagaga gagttccacc ctgtacttgg ggagagaaac 60 

agaaggtgag aaagtctttg gttctgaagc agcttctaag atcttttcat ttgcttcatt 120 

tcaaagttcc catgctgcca aagtgccatc ctttggggta ctgttttctg agctccagtg 180 

ataactcatt tatacaaggg agatacccag aaaaaaagtg agcaaatctt aaaaaggtgg 240 

cttgagttca gycttaaata ccatcttgaa atgamacaga gaaagaagga tgttgggtgg 300 

gagtggatag agaccctaac g 321 

<210> 75 

<211> 317 

<212> DNA 

<213> Homo sapiens 

<400> 75 

gcactgagag gaacttccac atgcactgag aaatgcatgt tcacaaggac tgaagtctgg 60 

aactcagttt ctcagttcca atcctgattc aggtgtttac cagctacaca accttaagca 120 

agtcagataa ccttagcttc ctcatatgca aaatgagaat gaaaagtact catcgctgaa 180 

ttgttttgag gattagaaaa acatctggca tgcagtagaa attcaattag tattcatttt 240 

cattcttcta aattaaacaa ataggatttt tagtggtgga acttcagaca ccagaaatgg 300 

gagtggatag agaccct 317 

<210> 76 

<211> 244 

<212> DNA 

<213> Homo sapiens 

<400> 76 

cgttagggtc tctatccact cccactactg atcaaactct atttatttaa ttatttttat 60 
catactttaa gttctgggat acacgtgcag catgcgcagg tttgttgcat aggtatacac 120 
ttgccatggt ggtttgctgc acccatcagt ccatcatcta cattaggtat ttctcctaat 180 
gctatccctc ccctagcccc ttacaccccc aacaggctct agtgtgtgaa gttcctctca 240 
gtgc 244 

<210> 77 

<211> 254 

<212> DNA 

<213> Homo sapiens 

<400> 77 

cgttagggtc tctatccact gaaatctgaa gcacaggagg aagagaagca gtyctagtga 60 

gatggcaagt tcwtttacca cactctttaa catttygttt agttttaacc tttatttatg 120 

gataataaag gttaatatta ataatgattt attttaaggc attcccraat ttgcataatt 180 

ctccttttgg agataccctt ttatctccag tgcaagtctg gatcaaagtg atasamagaa 240 

gttcctctca gtgc 254 

<210> 78 

<211> 355 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 69, 87, 186, 192, 220, 227, 251, 278, 339, 346, 350 
<223> n « A,T,C or G 

<400> 78 

ttcgatacag gcaaacatga actgcaggag ggtggtgacg atcatgatgt tgccgatggt 60 
ccggatggnc acgaagacgc actggancac gtgcttacgt ccttttgctc tgttgatggc 120 
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cctgagggga cgcaggaccc ttatgaccct cagaatcttc acaacgggag atggcactgg 180 

attgantccc antgacacca gagacacccc aaccaccagn atatcantat attgatgtag 240 

ttcctgtaga nggccccctt gtggaggaaa gctccatnag ttggtcatct tcaacaggat 300 

ctcaacagtt tccgatggct gtgatgggca tagtcatant taaccntgtn tcgaa 355 

<210> 79 

<211> 406 

<212> DNA 

<213> Homo sapiens 

<400> 79 

taagagggta ccagcagaaa ggttagtatc atcagatagc atcttatacg agtaatatgc 60 
ctgctatttg aagtgtaatt gagaaggaaa attttagcgt gctcactgac ctgcctgtag 120 
ccccagtgac agctaggatg tgcattctcc agccatcaag agactgagtc aagttgttcc 180 
ttaagtcaga acagcagact cagctctgac attctgattc gaatgacact gttcaggaat 240 
cggaatcctg tcgattagac tggacagctt gtggcaagtg aatttgcctg taacaagcca 300 
gattttttaa aatttatatt gtaaataatg tgtgtgtgtg tgtgtgtata tatatatata 360 
tgtacagtta tctaagttaa tttaaaagtt gtttggtacc ctctta 406 

<210> 80 

<211> 327 

<212> DNA 

<213> Homo sapiens 

<400> 80 

tttttttttt tttactcggc tcagtctaat cctttttgta gtcactcata ggccagactt 60 

agggctagga tgatgattaa taagagggat gacataacta ttagtggcag gttagttgtt 120 

tgtagggctc atggtagggg taaaaggagg gcaatttcta gatcaaataa taagaaggta 180 

atagctacta agaagaattt tatggagaaa gggacgcggg cgggggatat agggtcgaag 240 

ccgcactcgt aaggggtgga tttttctatg tagccgttga gttgtggtag tcaaaatgta 300 

ataattatta gtagtaagcc taggaga 327 

<210> 81 

<211> 318 

<212> DNA 

<213> Homo sapiens 

<400> 81 

tagtctatgc ggttgattcg gcaatccatt atttgctgga ttttgtcatg tgttttgcca 60 
attgcattca taatttatta tgcatttatg cttgtatctc ctaagtcatg gtatataatc 120 
catgcttttt atgttttgtc tgacataaac tcttatcaga gccctttgca cacagggatt 180 
caataaatat taacacagtc tacatttatt tggtgaatat tgcatatctg ctgtactgaa 240 
agcacattaa gtaacaaagg caagtgagaa gaatgaaaag cactactcac aacagttatc 300 
atgattgcgc atagacta 318 

<210> 82 

<211> 338 

<212> DNA 

<213> Homo sapiens 

<400> 82 

tcttcaacct ctactcccac taatagcttt ttgatgactt ctagcaagcc tcgctaacct 60 

cgccttaccc cccactatta acctactggg agaactctct gtgctagtaa ccacgttctc 120 

ctgatcaaat atcactctcc tacttacagg actcaacata ctagtcacag ccctatactc 180 

cctctacata tttaccacaa cacaatgggg ctcactcacc caccacatta acaacataaa 240 

accctcattc acacgagaaa acaccctcat gttcatacac ctatccccca ttctcctcct 300 

atccctcaac cccgacatca ttaccgggtt ttcctctt 338 
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<210> 83 

<211> 111 

<212> DNA 

<213> Homo sapiens 

<400> 83 

agccatttac cacccatcca caaaaaaaaa aaaaaaaaag aaaaatatca aggaataaaa 60 
atagactttg aacaaaaagg aacatttgct ggcctgagga ggcatcaccc g 111 

<210> 84 

<211> 224 

<212> DNA 

<213> Homo sapiens 

<400> 84 

tcgggtgatg cctcctcagg ccaagaagat aaagcttcag acccctaaca catttccaaa 60 

aaggaagaaa ggagaaaaaa gggcatcatc cccgttccga agggtcaggg aggaggaaat 120 

tgaggtggat tcacgagttg cggacaactc ctttgatgcc aagcgaggtg cagccggaga 180 

ctggggagag cgagccaatc aggttttgaa gttcctctca gtgc 224 

<210> 85 

<211> 348 

<212> DNA 

<213> Homo sapiens 

<400> 85 

gcactgagag gaacttcgtt ggaaacgggt ttttttcatg taaggctaga cagaagaatt 60 

ctcagtaact tccttgtgtt gtgtgtattc aactcacasa gttgaacgat cctttacaca 120 

gagcagactt gtaacactct twttgtggaa tttgcaagtg gagatttcag scgctttgaa 180 

gtsaaaggta gaaaaggaaa tatcttccta taaaaactag acagaatgat tctcagaaac 240 

tcctttgtga tgtgtgcgtt caactcacag agtttaacct ttcwtttcat agaagcagtt 300 
aggaaacact ctgtttgtaa agtctgcaag tggatagaga ccctaacg 348 

<210> 86 

<211> 293 

<212> DNA 

<213> Homo sapiens 

<400> 86 

gcactgagag gaacttcytt gtgwtgtktg yattcaactc acagagttga asswtsmttt 60 

acabagwkca ggcttkcaaa cactcttttt gtmgaatytg caagwggaka tttsrrccrc 120 

tttgwggycw wysktmgaaw mggrwatatc ttcwyatmra amctagacag aaksattctc 180 

akaawstyyy ytgtgawgws tgcrttcaac tcacagagkt kaacmwtyct kytsatrgag 240 

cagttwkgaa actctmtttc tttggattct gcaagtggat agagacccta acg 293 

<210> 87 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 87 

ctcctaggct 10 



<210> 88 
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<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 88 

agtagttgcc 10 

<210> 89 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<210> 90 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 90 

tggtaaaggg 10 

<210> 91 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<210> 92 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<400> 89 
ttccgttatg c 



11 



<400> 91 
tcggtcatag 



10 



<400> 92 
tacaacgagg 



10 
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<210> 93 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 93 

tggattggtc 10 

<210> 94 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<210> 95 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 95 

ttttggctcc 10 

<210> 96 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<210> 97 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<400> 94 
ctttctaccc 



10 



<400> 96 
ggaaccaatc 



10 
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<400> 97 
tcgatacagg 



10 



<210> 98 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 98 

ggtactaagg 10 

<210> 99 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<210> 100 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 100 

ctatccatgg 10 

<210> 101 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 101 

tctgtccaca 10 

<210> 102 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 



<400> 99 
agtctatgcg 



10 
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tumor cDNA 
<400> 102 

aagagggtac X0 

<210> 103 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 103 

cttcaacctc 10 

<210> 104 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 104 

gctcctcttg ccttaccaac 20 

<210> 105 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 105 

gtaagtcgag cagtgtgatg 20 

<210> 106 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 106 

gtaagtcgag cagtctgatg 20 

<210> 107 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 107 

gacttagtgg aaagaatgta 20 

<210> 108 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 108 

gtaattccgc caaccgtagt 20 

<210> 109 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 109 

atggttgatc gatagtggaa 20 

<210> 110 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 110 

acggggaccc ctgcattgag 20 

<210> 111 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 111 

tattctagac cattcgctac 20 

<210> 112 
<211> 20 
<212> DNA 



WO03Q13431 fhttp:/ /www .qett h e pa t e nt xom xpc?fromCache-l pa rt=maintoolbar=tF aqe 20 1 of 308 



WO 03/013431 PCTYUS02/24917 

33 



<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 112 

acataaccac tttagcgttc 20 

<210> 113 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 113 

cgggtgatgc ctcctcaggc 20 

<210> 114 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 114 

agcatgttga gcccagacac 20 

<210> 115 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 115 

gacaccttgt ccagcatctg 20 

<210> 116 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 116 

tacgctgcaa cactgtggag 20 



<210> 117 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 117 

cgttagggtc tctatccact 20 

<210> 118 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 118 

agactgactc atgtccccta 20 

<210> 119 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 119 

tcatcgctcg gtgactcaag 20 

<210> 120 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for -amplification from breast cancer 
tumor cDNA 

<400> 120 

caagattcca taggctgacc 20 

<210> 121 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 



<400> 121 

acgtactggt cttgaaggtc 



20 
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<210> 122 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 122 

gacgcttggc cacttgacac 20 

<210> 123 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 123 

gtatcgacgt agtggtctcc 20 

<210> 124 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 124 

tagtgacatt acgacgctgg 20 

<210> 125 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 125 

cgggtgatgc ctcctcaggc 20 

<210> 126 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 
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<400> 126 

atggctattt tcgggggctg aca 23 

<210> 127 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 127 

ccggtatctc ctcgtgggta tt 22 

<210> 128 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 128 

ctgcctgagc cacaaatg 18 

<210> 129 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 129 

ccggaggagg aagctagagg aata 24 

<210> 130 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer for amplification from breast cancer 
tumor cDNA 

<400> 130 

tttttttttt ttag 14 

<210> 131 
<211> 18 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Predicited Th Motifs (B-cell epitopes) 
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<400> 131 

Ser Ser Gly Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val 

15 10 15 

Gly lie 



<210> 132 
<211> 22 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Predicited Th Motifs (B-cell epitopes) 

<221> VARIANT 
<222> 13 

<223> Xaa = Any Amino Acid 
<400> 132 

Gin Gly Ala Ala Gin Lys Pro lie Asn Leu Ser Lys Xaa lie Glu Val 

15 10 15 

Val Gin Gly His Asp Glu 
20 



<210> 133 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Predicited Th Motifs (B-cell epitopes) 
<400> 133 

Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu Ala Tyr Arg lie Tyr 

15 10 15 

Thr Pro Phe Asp Leu Ser Ala 
20 



<210> 134 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Predicited HLA A2.1 Motifs (T-cell epitopes) 
<400> 134 

Tyr Leu Leu Val Gly He Gin Gly Ala 
1 5 



<210> 135 
<211> 9 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Predicited HLA A2.1 Motifs (T-cell epitopes) 
<400> 135 

Gly Ala Ala Gin Lys Pro lie Asn Leu 
1 5 



<210> 136 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Predicited HLA A2,l Motifs (T-cell epitopes) 

<221> VARIANT 
<222> 5 

<223> Xaa = Any Amino Acid 
<400> 136 

Asn Leu Ser Lys Xaa lie Glu Val Val 
1 5 



<210> 137 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Predicited HLA A2.1 Motifs (T-cell epitopes) 
<400> 137 

Glu Val Val Gin Gly His Asp Glu Ser 
1 5 



<210> 138 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Predicited HLA A2.1 Motifs (T-cell epitopes) 
<400> 138 

His Leu Gin Glu Ala Tyr Arg lie Tyr 
1 5 



<210> 139 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Predicated HLA A2.1 Motifs (T-cell epitopes) 
<400> 139 

Asn Leu Ala Phe Val Ala Gin Ala Ala 
1 5 



<210> 140 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Predicited HLA A2.1 Motifs (T-cell epitopes) 
<400> 140 

Phe Val Ala Gin Ala Ala Pro Asp Ser 
1 5 



<210> 141 
<211> 9388 
<212> DNA 

<213> Homo sapiens 
<400> 141 

gctcgcggcc gcgagctcaa ttaaccctca ctaaagggag tcgactcgat cagactgtta 60 
ctgtgtctat gtagaaagaa gtagacataa gagattccat tttgttctgt actaagaaaa 120 
attcttctgc cttgagatgc tgttaatctg taaccctagc cccaaccctg tgctcacaga 180 
gacatgtgct gtgttgactc aaggttcaat ggatttaggg ctatgctttg ttaaaaaagt 240 
gcttgaagat aatatgcttg ttaaaagtca tcaccattct ctaatctcaa gtacccaggg 300 
acacaataca ctgcggaagg ccgcagggac ctctgtctag gaaagccagg tattgtccaa 360 
gatttctccc catgtgatag cctgagatat ggcctcatgg gaagggtaag acctgactgt 420 
cccccagccc gacatccccc agcccgacat cccccagccc gacacccgaa aagggtctgt 480 
gctgaggagg attagtaaaa gaggaaggcc tctttgcagt tgaggtaaga ggaaggcatc 540 
tgtctcctgc tcgtccctgg gcaatagaat gtcttggtgt aaaacccgat tgtatgttct 600 
acttactgag ataggagaaa acatccttag ggctggaggt gagacacgct ggcggcaata 660 
ctgctcttta atgcaccgag atgtttgtat aagtgcacat caaggcacag cacctttcct 720 
taaacttatt tatgacacag agacctttgt tcacgttttc ctgctgaccc tctccccact 780 
attaccctat tggcctgcca catccccctc tccgagatgg tagagataat gatcaataaa 840 
tactgaggga actcagagac cagtgtccct gtaggtcctc cgtgtgctga gcgccggtcc 900 
cttgggctca cttttctttc tctatacttt gtctctgtgt ctctttcttt tctcagtctc 960 
tcgttccacc tgacgagaaa tacccacagg tgtggagggg caggccaccc cttcaataat 1020 
ttactagcct gttcgctgac aacaagactg gtggtgcaga aggttgggtc ttggtgttca 1080 
ccgggtggca ggcatgggcc aggtgggagg gtctccagcg cctggtgcaa atctccaaga 1140 
aagtgcagga aacagcacca agggtgattg taaattttga tttggcgcgg caggtagcca 1200 
ttccagcgca aaaatgcgca ggaaagcttt tgctgtgctt gtaggcaggt aggccccaag 1260 
cacttcttat tggctaatgt ggagggaacc tgcacatcca ttggctgaaa tctccgtcta 1320 
tttgaggctg actgagcgcg ttcctttctt ctgtgttgcc tggaaacgga ctgtctgcct 1380 
agtaacatct gatcacgttt cccattggcc gccgtttccg gaagcccgcc ctcccatttc 1440 
cggaagcctg gcgcaaggtt ggtctgcagg tggcctccag gtgcaaagtg ggaagtgtga 1500 
gtcctcagtc ttgggctatt cggccacgtg cctgccggac atgggacgct ggagggtcag 1560 
cagcgtggag tcctggcctt ttgcgtccac gggtgggaaa ttggccattg ccacggcggg 1620 
aactgggact caggctgccc cccggccgtt tctcatccgt ccaccggact cgtgggcgct 1680 
cgcactggcg ctgatgtagt ttcctgacct ctgacccgta ttgtctccag attaaaggta 1740 
aaaacggggc tttttcagcc cactcgggta aaacgccttt tgatttctag gcaggtgttt 1800 
tgttgcacgc ctgggaggga gtgacccgca ggttgaggtt tattaaaata cattcctggt 1860 
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ttatgttatg tttataataa agcaccccaa cctttacaaa atctcacttt ttgccagttg 1920 
tattatttag tggactgtct ctgataagga cagccagtta aaatggaatt ttgttgttgc 1980 
taattaaacc aatttttagt tttggtgttt gtcctaatag caacaacttc tcaggcttta 2040 
taaaaccata tttcttgggg gaaatttctg tgtaaggcac agcgagttag tttggaattg 2100 
ttttaaagga agtaagttcc tggttttgat atcttagtag tgtaatgccc aacctggttt 2160 
ttactaaccc tgtttttaga ctctcccttt ccttaaatca cctagccttg tttccacctg 2220 
aattgactct cccttagcta agagcgccag atggactcca tcttggctct ttcactggca 2280 
gccccttcct caaggactta acttgtgcaa gctgactccc agcacatcca agaatgcaat 2340 
taactgttaa gatactgtgg caagctatat ccgcagttcc gaggaattca tccgattgat 2400 
tatgcccaaa agccccgcgt ctatcacctt gtaataatct taaagcccct gcacctggaa 2460 
ctattaactt tcctgtaacc atttatcctt ttaacttttt tgcttacttt atttctgtaa 2520 
aattgtttta actagacctc ccctcccctt tctaaaccaa agtataaaag aagatctagc 2580 
cccttcttca gagcggagag aattttgagc attagccatc tcttggcggc cagctaaata 2640 
aatggacttt taatttgtct caaagtgtgg cgttttctct aactcgctca ggtacgacat 2700 
ttggaggccc cagcgagaaa cgtcaccggg agaaacgtca ccgggcgaga gccgggcccg 2760 
ctgtgtgctc ccccggaagg acagccagct tgtagggggg agtgccacct gaaaaaaaaa 2820 
tttccaggtc cccaaagggt gaccgtcttc cggaggacag cggatcgact accatgcggg 2880 
tgcccaccaa aattccacct ctgagtcctc aactgctgac cccggggtca ggtaggtcag 2940 
atttgacttt ggttctggca gagggaagcg accctgatga gggtgtccct cttttgactc 3000 
tgcccatttc tctaggatgc tagagggtag agccctggtt ttctgttaga cgcctctgtg 3060 
tctctgtctg ggagggaagt ggccctgaca ggggccatcc cttgagtcag tccacatccc 3120 
aggatgctgg gggactgagt cctggtttct ggcagactgg tctctctctc tctctttttc 3180 
tatctctaat ctttccttgt tcaggtttct tggagaatct ctgggaaaga aaaaagaaaa 3240 
actgttataa actctgtgtg aatggtgaat gaatggggga ggacaagggc ttgcgcttgt 3300 
cctccagttt gtagctccac ggcgaaagct acggagttca agtgggccct cacctgcggt 3360 
tccgtggcga cctcataagg cttaaggcag catccggcat agctcgatcc gagccggggg 3420 
tttataccgg cctgtcaatg ctaagaggag cccaagtccc ctaaggggga gcggccaggc 3480 
gggcatctga ctgatcccat cacgggaccc cctccccttg tttgtctaaa aaaaaaaaaa 3540 
gaagaaactg tcataactgt ttacatgccc tagggtcaac tgtttgtttt atgtttattg 3600 
ttctgttcgg tgtctattgt cttgtttagt ggttgtcaag gttttgcatg tcaggacgtc 3660 
gatattgccc aagacgtctg ggtaagaact tctgcaaggt ccttagtgct gattttttgt 3720 
cacaggaggt taaatttctc atcaatcatt taggctggcc accacagtcc tgtcttttct 3780 
gccagaagca agtcaggtgt tgttacggga atgagtgtaa aaaaacattc gcctgattgg 3840 
gatttctggc accatgatgg ttgtatttag attgtcatac cccacatcca ggttgattgg 3900 
acctcctcta aactaaactg gtggtgggtt caaaacagcc accctgcaga tttccttgct 3960 
cacctctttg gtcattctgt aacttttcct gtgcccttaa atagcacact gtgtagggaa 4020 
acctaccctc gtactgcttt acttcgttta gattcttact ctgttcctct gtggctactc 4080 
tcccatctta aaaacgatcc aagtggtcct tttcctcctc cctgccccct accccacaca 4140 
tctcgttttc cagtgcgaca gcaagttcag cgtctccagg acttggctct gctctcactc 4200 
cttgaaccct taaaagaaaa agctgggttt gagctatttg cctttgagtc atggagacac 4260 
aaaaggtatt tagggtacag atctagaaga agagagagaa cacctagatc caactgaccc 4320 
aggagatctc gggctggcct ctagtcctcc tccctcaatc ttaaagctac agtgatgtgg 4380 
caagtggtat ttagctgttg tggtttttct gctctttctg gtcatgttga ttctgttctt 4440 
tcgatactcc agccccccag ggagtgagtt tctctgtctg tgctgggttt gatatctatg 4500 
ttcaaatctt attaaattgc cttcaaaaaa aaaaaaaaaa gggaaacact tcctcccagc 4560 
cttgtaaggg ttggagccct ctccagtata tgctgcagaa tttttctctc ggtttctcag 4620 
aggattatgg agtccgcctt aaaaaaggca agctctggac actctgcaaa gtagaatggc 4680 
caaagtttgg agttgagtgg ccccttgaag ggtcactgaa cctcacaatt gttcaagctg 4740 
tgtggcgggt tgttactgaa actcccggcc tccctgatca gtttccctac attgatcaat 4800 
ggctgagttt ggtcaggagc accccttcca tggctccact catgcaccat tcataatttt 4860 
acctccaagg tcctcctgag ccagaccgtg ttttcgcctc gaccctcagc cggttcagct 4920 
cgccctgtac tgcctctctc tgaagaagag gagagtctcc ctcacccagt cccaccgcct 4980 
taaaaccagc ctactccctt agggtcatcc catgtctcct cggctatgtc ccctgtaggc 5040 
tcatcaccca ttgcctcttg gttgcaaccg tggtgggagg aagtagcccc tctactacca 5100 
ctgagagagg cacaagtccc tctgggtgat gagtgctcca cccccttcct ggtttatgtc 5160 
ccttctttct acttctgact tgtataattg gaaaacccat aatcctccct tctctgaaaa 5220 
gccccaggct ttgacctcac tgatggagtc tgtactctgg acacattggc ccacctggga 5280 
tgactgtcaa cagctccttt tgaccctttt cacctctgaa gagagggaaa gtatccaaag 5340 
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agaggccaaa aagtacaacc tcacatcaac caataggccg gaggaggaag ctagaggaat 5400 
agtgattaga gacccaattg ggacctaatt gggacccaaa tttctcaagt ggagggagaa 5460 
cttttgacga tttccaccgg tatctcctcg tgggtattca gggagctgct cagaaaccta 5520 
taaacttgtc taaggcgact gaagtcgtcc aggggcatga tgagtcacca ggagtgtttt 5580 
tagagcacct ccaggaggct tatcggattt acaccccttt tgacctggca gcccccgaaa 5640 
atagccatgc tcttaatttg gcatttgtgg ctcaggcagc cccagatagt aaaaggaaac 5700 
tccaaaaact agagggattt tgctggaatg aataccagtc agcttttaga gatagcctaa 5760 
aaggtttttg acagtcaaga ggttgaaaaa caaaaacaag cagctcaggc agctgaaaaa 5820 
agccactgat aaagcatcct ggagtatcag agtttactgt tagatcagcc tcatttgact 5880 
tcccctccca catggtgttt aaatccagct acactacttc ctgactcaaa ctccactatt 5940 
cctgttcatg actgtcagga actgttggaa actactgaaa ctggccgacc tgatcttcaa 6000 
aatgtgcccc taggaaaggt ggatgccacc gtgttcacag acagtagcag cttcctcgag 6060 
aagggactac gaaaggccgg tgcagctgtt accatggaga cagatgtgtt gtgggctcag 6120 
gctttaccag caaacacctc agcacaaaag gctgaattga tcgccctcac tcaggctctc 6180 
cgatggggta aggatattaa cgttaacact gacagcaggt acgcctttgc tactgtgcat 6240 
gtacgtggag ccatctacca ggagcgtggg ctactcacct cagcaggtgg ctgtaatcca 6300 
ctgtaaagga catcaaaagg aaaacacggc tgttgcccgt ggtaaccaga aagctgattc 6360 
agcagctcaa gatgcagtgt gactttcagt cacgcctcta aacttgctgc ccacagtctc 6420 
ctttccacag ccagatctgc ctgacaatcc cgcatactca acagaagaag aaaactggcc 6480 
tcagaactca gagccaataa aaatcaggaa ggttggtgga ttcttcctga ctctagaatc 6540 
ttcatacccc gaactcttgg gaaaacttta atcagtcacc tacagtctac cacccattta 6600 
ggaggagcaa agctacctca gctcctccgg agccgtttta agatccccca tcttcaaagc 6660 
ctaac agate aagcagctct ccggtgcaca acctgcgccc aggtaaatgc caaaaaaggt 6720 
cctaaaccca gcccaggcca ccgtctccaa gaaaactcac caggagaaaa gtgggaaatt 6780 
gactttacag aagtaaaacc acaoeggget gggtacaaat accttctagt actggtagac 6840 
accttctctg gatggactga ageatttget accaaaaacg aaactgtcaa tatggtagtt 6900 
aagtttttac tcaatgaaat catccctcga cgtgggctgc ctgttgccat agggtctgat 6960 
aatggaccgg ccttcgcctt gtctatagtt tagtcagtca gtaaggcgtt aaacattcaa 7020 
tggaagctcc attgtgccta tcgaccccag agctctgggc aagtagaacg catgaactgc 7080 
accctaaaaa acactcttac aaaattaatc ttagaaaccg gtgtaaattg tgtaagtctc 7140 
cttcctttag ccctacttag agtaaggtgc accccttact gggctgggtt cttacctttt 7200 
gaaatcatgt atgggagggc gctgcctatc ttgectaage taagagatgc ccaattggca 7260 
aaaatatcac aaactaattt attacagtac ctacagtctc cccaacaggt acaagatatc 7320 
atcctgccac ttgttcgagg aacccatccc aatccaattc ctgaacagac agggccctgc 7380 
cattcattcc cgccaggtga cctgttgttt gttaaaaagt tccagagaga aggactccct 7440 
cctgcttgga agagacctca caccgtcatc aegatgecaa eggctctgaa ggtggatggc 7500 
attcctgegt ggattcatca ctcccgcatc aaaaaggeca acggagccca actagaaaca 7560 
tgggtcccca gggctgggtc aggcccctta aaactgcacc taagttgggt gaagecatta 7620 
gattaattct ttttcttaat tttgtaaaac aatgeatage ttctgtcaaa cttatgtatc 7680 
ttaagactca atataacccc cttgttataa ctgaggaatc aatgatttga ttccccaaaa 7740 
acacaagtgg ggaatgtagt gtccaacctg gtttttacta accctgtttt tagactctcc 7800 
ctttccttta atcactcagc cttgtttcca cctgaattga ctctccctta getaagageg 7860 
ccagatggac tccatcttgg ctctttcact ggcagccgct tcctcaagga cttaacttgt 7920 
gcaagctgac tcccagcaca tccaagaatg caattaactg ataagatact gtggcaagct 7980 
atatcegcag ttcccaggaa ttcgtccaat tgattacacc caaaagcccc gegtctatea 8040 
ccttgtaata atcttaaagc ccctgcacct ggaactatta acgttcctgt aaccatttat 8100 
ccttttaact tttttgecta ctttatttct gtaaaattgt tttaactaga ccccccctct 8160 
cctttctaaa ccaaagtata aaagcaaatc tagccccttc ttcaggcega gagaatttcg 8220 
agegttagee gtctcttggc caccagctaa ataaaeggat tcttcatgtg tctcaaagtg 8280 
tggcgttttc tctaactcgc tcaggtacga ccgtggtagt attttcccca aegtcttatt 8340 
tttagggcac gtatgtagag taacttttat gaaagaaacc agttaaggag gttttgggat 8400 
ttcctttatc aactgtaata ctggttttga ttatttattt atttatttat tttttttgag 8460 
aaggagtttc actcttgttg cccaggctgg agtgcaatgg tgcgatcttg gctcactgca 8520 
acttccgcct cccaggttca agegattetc ctgcctcagc ctcgagagta gctgggatta 8580 
taggcatgeg ccaccacacc cagctaattt tgtattttta gtaaagatgg ggtttcttca 8640 
tgttggtcaa gctggtctgg aactccccgc ctegggtgat ctgcccgcct cggcctccga 8700 
aagtgctggg attacaggtg tgatccacca cacccagccg atttatatgt atataaatca 8760 
cattcctcta accaaaatgt agtgtttcct tccatcttga atataggctg tagaccccgt 8820 
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gggtatggga cattgttaac agtgagacca cagcagtttt tatgtcatct gacagcatct 8880 
ccaaatagcc ttcatggttg tcactgcttc ccaagacaat tccaaataac acttcccagt 8940 
gatgacttgc tacttgctat tgttacttaa tgtgttaagg tggctgttac agacactatt 9000 
agtatgtcag gaattacacc aaaatttagt ggctcaaaca atcattttat tatgtatgtg 9060 
gattctcatg gtcaggtcag gatttcagac agggcacaag ggtagcccac ttgtctctgt 9120 
ctatgatgtc tggcctcagc acaggagact caacagctgg ggtctgggac catttggagg 9180 
cttgttccct cacatctgat acctggcttg ggatgttgga agagggggtg agctgagact 9240 
gagtgcctat atgtagtgtt tccatatggc cttgacttcc ttacagcctg gcagcctcag 9300 
ggtagtcaga attcttagga ggcacagggc tccagggcag atgctgaggg gtcttttatg 9360 
aggtagcaca gcaaatccac ccaggatc 9388 



<210> 142 
<211> 419 
<212> DNA 
<213> Homo sapiens 

<400> 142 

tgtaagtcga gcagtgtgat ggaaggaatg gtctttggag agagcatatc catctcctcc 60 

tcactgcctc ctaatgtcat gaggtacact gagcagaatt aaacagggta gtcttaacca 120 

cactattttt agctaccttg tcaagctaat ggttaaagaa cacttttggt ttacacttgt 180 

tgggtcatag aagttgcttt ccgccatcac gcaataagtt tgtgtgtaat cagaaggagt 240 

taccttatgg tttcagtgtc attctttagt taacttggga gctgtgtaat ttaggctttg 300 

cgtattattt cacttctgtt ctccacttat gaagtgattg tgtgttcgcg tgtgtgtgcg 360 

tgcgcatgtg cttccggcag ttaacataag caaataccca acatcacact gctcgactt 419 

<210> 143 

<211> 402 

<212> DNA 

<213> Homo sapiens 

<400> 143 

tgtaagtcga gcagtgtgat gtccactgca gtgtgttgct gggaacagtt aatgagcaaa 60 
ttgtatacaa tggctagtac attgaccggg atttgttgaa gctggtgagt gttatgactt 120 
agcctgttag actagtctat gcacatggct ctggtcaact accgctctct catttctcca 180 
gataaatccc ccatgcttta tattctcttc caaacatact atcctcatca ccacatagtt 240 
cctttgttaa tgctttgttc tagactttcc cttttctgtt ttcttattca aacctatatc 300 
tctttgcata gattgtaaat tcaaatgccc tcagggtgca ggcagttcat gtaagggagg 360 
gaggctagcc agtgagatct gcatcacact gctcgactta ca 402 

<210> 144 
<211> 224 
<212> DNA 
<213> Homo sapiens 

<400> 144 

tcgggtgatg cctcctcagg ccaagaagat aaagcttcag acccctaaca catttccaaa 60 
aaggaagaaa ggagaaaaaa gggcatcatc cccgttccga agggtcaggg aggaggaaat 120 
tgaggtggat tcacgagttg cggacaactc ctttgatgcc aagcgaggtg cagccggaga 180 
ctggggagag cgagccaatc aggttttgaa gttcctctca gtgc 224 

<210> 145 
<211> 111 
<212> DNA 
<213> Homo sapiens 



<400> 145 
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agccatttac cacccatcca caaaaaaaaa aaaaaaaaag aaaaatatca aggaataaaa 60 
atagactttg aacaaaaagg aacatttgct ggcctgagga ggcatcaccc g 111 

<210> 146 
<211> 585 
<212> DNA 
<213> Homo sapiens 

<400> 146 

tagcatgttg agcccagaca cttgtagaga gaggaggaca gttagaagaa gaagaaaagt 60 

ttttaaatgc tgaaagttac tataagaaag ctttggcttt ggatgagact tttaaagatg 120 

cagaggatgc tttgcagaaa cttcataaat atatgcaggt gattccttat ttcctcctag 180 

aaatttagtg atatttgaaa taatgcccaa acttaatttt ctcctgagga aaactattct 240 

acattactta agtaaggcat tatgaaaagt ttctttttag gtatagtttt tcctaattgg 300 

gtttgacatt gcttcatagt gcctctgttt ttgtccataa tcgaaagtaa agatagctgt 360 

gagaaaacta ttacctaaat ttggtatgtt gttttgagaa atgtccttat agggagctca 420 

cctggtggtt tttaaattat tgttgctact ataattgagc taattataaa aacctttttg 480 

agacatattt taaattgtct tttcctgtaa tactgatgat gatgttttct catgcatttt 540 

cttctgaatt gggaccattg ctgctgtgtc tgggctcaca tgcta 585 

<210> 147 
<211> 579 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 383, 453, 465, 501 
<223> n = A,T,C or G 

<400> 147 

tagcatgttg agcccagaca ctgggcagcg ggggtggcca cggcagctcc tgccgagccc 60 

aagcgtgttt gtctgtgaag gaccctgacg tcacctgcca ggctagggag gggtcaatgt 120 

ggagtgaatg ttcaccgact ttcgcaggag tgtgcagaag ccaggtgcaa cttggtttgc 180 

ttgtgttcat cacccctcaa gatatgcaca ctgctttcca aataaagcat caactgtcat 240 

ctccagatgg ggaagacttt ttctccaacc agcaggcagg tccccatcca ctcagacacc 300 

agcacgtcca ccttctcggg cagcaccacg tcctccacct tctgctggta cacggtgatg 360 

atgtcagcaa agccgttctg cangaccagc tgccccgtgt gctgtgccat ctcactggcc 420 

tccaccgcgt acaccgctct aggccgcgca tantgtgcac agaanaaatg atgatccagt 480 

cccacagccc acgtccaaga ngactttatc cgtcagggat tctttattct gcaggatgac 540 

ctgtggtatt aattgttcgt gtctgggctc aacatgcta 579 

<210> 148 

<211> 249 

<212> DNA 

<213> Homo sapiens 

<400> 148 

tgacaccttg tccagcatct gcaagccagg aagagagtcc tcaccaagat ccccaccccg 60 
ttggcaccag gatcttggac ttccaatctc cagaactgtg agaaataagt atttgtcgct 120 
aaataaatct ttgtggtttc agatatttag ctatagcaga tcaggctgac taagagaaac 180 
cccataagag ttacatactc attaatctcc gtctctatcc ccaggtctca gatgctggac 240 
aaggtgtca 249 

<210> 149 

<211> 255 

<212> DNA 

<213> Homo sapiens 
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<400> 149 

tgacaccttg tccagcatct gctattttgt gactttttaa taatagccat tctgactggt 60 
gtgagatggt aactcattgt gggtttggtc tgcatttctc taatgatcag tgatattaag 120 
ctttttttaa atatgcttgt tgaccacatg tatatcatct tttgagaagt gtctgttcat 180 
atcctttgcc cactttttaa tttttttatc ttgtaaattt gtttaatttc cttacagatg 240 
ctggacaagg tgtca 255 

<210> 150 
<211> 318 
<212> DNA 

<213> Homo sapiens 
<400> 150 

ttacgctgca acactgtgga ggccaagctg ggatcacttc ttcattctaa ctggagagga 60 
gggaagttca agtccagcag agggtgggtg ggtagacagt ggcactcaga aatgtcagct 120 
ggacccctgt ccccgcatag gcaggacagc aaggctgtgg ctctccaggg ccagctgaag 180 
aacaggacac tgtctccgct gccacaaagc gtcagagact cccatctttg aagcacggcc 240 
ttcttggtct tcctgcactt ccctgttctg ttagagacct ggttatagac aaggcttctc 300 
cacagtgttg cagcgtaa 318 

<210> 151 
<211> 323 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 2, 7, 10, 13, 14, 23, 26, 32, 44, 54, 56, 67, 74, 75, 81, 
87, 104, 105, 109, 111, 120, 123, 124, 136, 137, 138, 151, 
155, 162, 168, 171, 176, 184, 186, 196, 215, 231, 239, 252, 
265, 288, 318 
<223> n = A,T,C or G 

<400> 151 

tnacgcngcn acnntgtaga ganggnaagg cnttccccac attncccctt catnanagaa 60 

ttattcnacc aagnntgacc natgccnttt atgacttaca tgcnnactnc ntaatctgtn 120 

tcnngcctta aaagcnnntc cactacatgc ntcancactg tntgtgtnac ntcatnaact 180 

gtcngnaata ggggcncata actacagaaa tgcanttcat actgcttcca ntgccateng 240 

cgtgtggcct tncctactct tcttntattc caagtagcat ctctggantg cttccccact 300 

ctccacattg ttgcagcnat aat 323 

<210> 152 
<211> 311 
<212> DNA 

<213> Homo sapiens 
<400> 152 

tcaagattcc ataggctgac cagtccaagg agagttgaaa tcatgaagga gagtctatct 60 
ggagagagct gtagttttga gggttgcaaa gacttaggat ggagttggtg ggtgtggtta 120 
gtctctaagg ttgattttgt tcataaattt catgccctga atgccttgct tgcctcaccc 180 
tggtccaagc cttagtgaac acctaaaagt ctctgtcttc ttgctctcca aacttctcct 240 
gaggatttcc tcagattgtc tacattcaga tcgaagccag ttggcaaaca agatgcagtc 300 
cagagggtca g 311 

<210> 153 
<211> 332 
<212> DNA 
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<213> Homo sapiens 
<400> 153 

caagattcca taggctgacc aggaggctat tcaagatctc tggcagttga ggaagtctct 60 

ttaagaaaat agtttaaaca atttgttaaa atttttctgt cttacttcat ttctgtagca 120 

gttgatatct ggctgtcctt tttataatgc agagtgggaa ctttccctac catgtttgat 180 

aaatgttgtc caggctccat tgccaataat gtgttgtcca aaatgcctgt ttagttttta 240 

aagacggaac tccacccttt gcttggtctt aagtatgtat ggaatgttat gataggacat 300 

agtagtagcg gtggtcagcc tatggaatct tg 332 

<210> 154 
<211> 345 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 154, 224, 297, 330 
<223> n = A,T,C or G 

<400> 154 

tcaagattcc ataggctgac ctggacagag atctcctggg tctggcccag gacagcaggc 60 

tcaagctcag tggagaaggt ttccatgacc ctcagattcc cccaaacctt ggattgggtg 120 

acattgcatc tcctcagaga gggaggagat gtangtctgg gcttccacag ggacctggta 180 

ttttaggatc agggtaccgc tggcctgagg cttggatcat tcanagcctg ggggtggaat 240 

ggctggcagc ctgtggcccc attgaaatag gctctggggc actccctctg ttcctanttg 300 
aacttgggta aggaacagga atgtggtcan cctatggaat cttga 345 

<210> 155 
<211> 295 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 46, 199, 252, 266 
<223> n = A,T,C or G 

<400> 155 

gacgcttggc cacttgacac attaaacagt tttgcataat cactancatg tatttctagt 60 
ttgctgtctg ctgtgatgcc ctgccctgat tctctggcgt taatgatggc aagcataatc 120 
aaacgctgtt ctgttaattc caagttataa ctggcattga ttaaagcatt atctttcaca 180 
actaaactgt tcttcatana acagcccata ttattatcaa attaagagac aatgtattcc 240 
aatatccttt anggccaata tatttnatgt cccttaatta agagctactg tccgt 295 

<210> 156 
<211> 406 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 172, 178, 332, 338, 342, 381, 400, 402 
<223> n = A,T,C or G 

<400> 156 

gacgcttggc cacttgacac tgcagtggga aaaccagcat gagccgctgc ccccaaggaa 60 
cctcgaagcc caggcagagg accagccatc ccagcctgca ggtaaagtgt gtcacctgtc 120 
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aggtgggctt ggggtgagtg ggtgggggaa gtgtgtgtgc aaagggggtg tnaatgtnta 180 
tgcgtgtgag catgagtgat ggctagtgtg actgcatgtc agggagtgtg aacaagcgtg 240 
cgggggtgtg tgtgcaagtg cgtatgcata tgagaatatg tgtctgtgga tgagtgcatt 300 
tgaaagtctg tgtgtgtgcg tgtggtcatg anggtaantt antgactgcg caggatgtgt 360 
gagtgtgcat ggaacactca ntgtgtgtgt caagtggccn ancgtc 406 

<210> 157 
<211> 208 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 115, 119, 182, 187 
<223> n « A,T,C or G 

<400> 157 

tgacgcttgg ccacttgaca cactaaaggg tgttactcat cactttcttc tctcctcggt 60 

ggcatgtgag tgcatctatt cacttggcac tcatttgttt ggcagtgact gtaanccana 120 

tctgatgcat acaccagctt gtaaattgaa taaatgtctc taatactatg tgctcacaat 180 
anggtanggg tgaggagaag gggagaga 208 

<210> 158 
<211> 547 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 235 

<223> n = A,T,C or G 
<400> 158 

cttcaacctc cttcaacctc cttcaacctc ctggattcaa acaatcatcc cacctcagac 60 
tccttagtag ctgagactac agactcacgc cactacatct ggctaaattt ttgtagagat 120 
agggtttcat catgttgccc tggctggtct caaactcctg acctcaagca atgtgcccac 180 
ctcagcctcc caaagtgctg ggattacagg cataagccac catgcccagt ccatntttaa 240 
tctttcctac cacattctta ccacactttc ttttatgttt agatacataa atgcttacca 300 
ttatgataca attgcccaca gtattaagac agtaacatgc tgcacaggtt tgtagcctag 360 
gaacagtagg caataccaca tagcttaggt gtgtggtaga ctataccatc taggtttgtg 420 
taagttacac tttatgctgt ttacacaatg acaaaaccat ctaatgatgc atttctcaga 480 
atgtatcctt gtcagtaagc tatgatgtac agggaacact gcccaaggac acagatattg 540 
tacctgt 547 

<210> 159 
<211> 203 
<212> DNA 
<213> Homo sapiens 

<400> 159 

gctcctcttg ccttaccaac tcacccagta tgtcagcaat tttatcrgct ttacctacga 60 
aacagcctgt atccaaacac ttaacacact cacctgaaaa gttcaggcaa caatcgcctt 120 
ctcatgggtc tctctgctcc agttctgaac ctttctcttt tcctagaaca tgcatttarg 180 
tcgatagaag ttcctctcag tgc 203 



<210> 160 
<211> 402 
<212> DNA 
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<213> Homo sapiens 
<400> 160 

tgtaagtcga gcagtgtgat gggtggaaca gggttgtaag cagtaattgc aaactgtatt 60 

taaacaataa taataatatt tagcatttat agagcacttt atatcttcaa agtacttgca 120 

aacattayct aattaaatac cctctctgat tataatctgg atacaaatgc acttaaactc 180 

aggacagggt catgagaraa gtatgcattt gaaagttggt gctagctatg ctttaaaaac 240 

ctatacaatg atgggraagt tagagttcag attctgttgg actgtttttg tgcatttcag 300 

ttcagcctga tggcagaatt agatcatatc tgcactcgat gactytgctt gataacttat 360 

cactgaaatc tgagtgttga tcatcacact gctcgactta ca 402 

<210> 161 
<211> 193 
<212> DNA 
<213> Homo sapiens 

<400> 161 

agcatgttga gcccagacac tgaccaggag aaaaaccaac caatagaaac acgcccagac 60 

actgaccagg agaaaaacca accaataaaa acaggcccgg acataagaca aataataaaa 120 

ttagcggaca aggacatgaa aacagctatt gtaagagcgg atatagtggt gtgtgtctgg 180 
gctcaacatg eta 193 

<210> 162 
<211> 147 
<212> DNA 
<213> Homo sapiens 

<400> 162 

tgttgagccc agacactgac caggagaaaa accaaccaat aaaaacaggc ccggacataa 60 

gacaaataat aaaattagcg gacaaggaca tgaaaacagc tattgtaaga geggatatag 120 

tggtgtgtgt ctgggctcaa catgeta 147 

<210> 163 
<211> 294 
<212> DNA 
<213> Homo sapiens 

<400> 163 

tagcatgttg agcccagaca caaatctttc cttaagcaat aaatcatttc tgcatatgtt 60 
tttaaaacca cagctaagcc atgattattc aaaaggacta ttgtattggg tattttgatt 120 
tgggttctta tctccctcac attatcttca tttctatcat tgacctctta tcccagagac 180 
tctcaaactt ttatgttata caaatcacat tctgtctcaa aaaatatctc acccacttct 240 
cttctgtttc tgcgtgtgta tgtgtgtgtg tgtgtgtctg ggctcaacat gcta 294 

<210> 164 
<211> 412 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 292 

<223> n = A,T,C or G 
<400> 164 

egggattgge tttgagctgc agatgetgee tgtgaccgca cccggcgtgg aacagaaagc 60 
cacctggctg caagtgcgcc agagccgccc tgactacgtg ctgctgtggg gctggggcgt 120 
gatgaactcc accgccctga aggaagecca ggccaccgga tacccccgcg acaagatgta 180 
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cggcgtgtgg tgggccggtg cggagcccga tgtgcgtgac gtgggcgaag gcgccaaggg 240 

ctacaacgcg ctggctctga acggctacgg cacgcagtcc aaggtgatcc angacatcct 300 

gaaacacgtg cacgacaagg gccagggcac ggggcccaaa gacgaagtgg gctcggtgct 360 

gtacacccgc ggcgtgatca tccagatgct ggacaaggtg tcaatcacta at 412 

<210> 165 
<211> 361 
<212> DNA 

<213> Homo sapiens 
<400> 165 

ttgacacctt gtccagcatc tgcatctgat gagagcctca gatggctacc actaatggca 60 
gaaggcaaag gagaacaggc attgtatggc aagaaaggaa gaaagagaga ggggagaaag 120 
gtgctaggtt cttttcaaca accagttctt gatggaactg agagtaagag ctcaaggcca 180 
ggtgtggtga ctccaaccag taatcccaac attttaggag gctgaggcag gcagatgtct 240 
tgaccccatg agtttgtgac cagcctgaac aacatcatga gactccatct ctacaataat 300 
tacaaaaatt aatcaggcat tgtggtatgc cctgtagtcc cagatgctgg acaaggtgtc 360 
a 361 

<210> 166 
<211> 427 
<212> DNA 

<213> Homo sapiens 
<400> 166 

twgactgact catgtcccct acacccaact atcttctcca ggtggccagg catgatagaa 60 
tctgatcctg acttagggga atattttctt tttacttccc atcttgattc cctgccggtg 120 
agtttcctgg ttcagggtaa gaaaggagct caggccaaag taatgaacaa atccatcctc 180 
acagacgtac agaataagag aacwtggacw tagccagcag aacmcaaktg aaamcagaac 240 
mcttaractag gatracaamc mcrraratar ktgcycmcmc wtataataga aaccaaactt 300 
gtatctaatt aaatatttat ccacygtcag ggcattagtg gttttgataa atacgctttg 360 
gctaggattc ctgaggttag aatggaaraa caattgcamc gagggtaggg gacatgagtc 420 
aktctaa 427 

<210> 167 
<211> 500 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 288, 303, 318, 326 
<223> n = A,T,C or G 

<400> 167 

aacgtcgcat gctcccggcc gccatggccg cgggatagac tgactcatgt cccctaagat 60 
agaggagaca cctgctaggt gtaaggagaa gatggttagg tctacggagg ctccagggtg 120 
ggagtagttc cctgctaagg gagggtagac tgttcaacct gttcctgctc cggcctccac 180 
tatagcagat gcgagcagga gtaggagaga gggaggtaag agtcagaagc ttatgttgtt 240 
tatgcgggga aacgccrtat cgggggcagc cragttatta ggggacantr tagwyartcw 300 
agntagcatc caaagcgngg gagttntccc atatggttgg acctgcaggc ggccgcatta 360 
gtgattagca tgtgagcccc agacacgcat agcaacaagg acctaaactc agate ctgtg 420 
ctgattactt aacatgaatt attgtattta tttaacaact ttgagttatg aggcatatta 480 
ttaggtccat attacctgga 500 



<210> 168 
<211> 358 
<212> DNA 
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<400> 168 

ttcatcgctc 

tcacctgagg 

aaaaatacaa 

gctgaggcag 

gatcatgcca 

aaaaaaagaa 



ggtgactcaa 
ttgggagttt 
aaattagcca 
gagaatcact 
ctgcactcca 
tgatcagagc 



gcctgtaatc 
gagaccagcc 
agcatggtgg 
tgaggccagg 
gcctgggcaa 
cacaaataca 



ccagaacttt 
tggccaacat 
catgcacttg 
aggcagaggt 
cagagtaaga 
gaaaaccttg 



gggaggccga 
ggtgacaacc 
taatcccagc 
tgcagtgagg 
ctccatctca 
agtcaccgag 



ggggagcaga 60 
cgtctctgct 120 
tactcgggag 180 
cagaggttga 240 
aaaaaaaaaa 300 
cgatgaaa 358 



<210> 169 

<211> 1265 

<212> DNA 

<213> Homo sapiens 



<400> 169 

ttctgtccac 

aacatgtatt 

tgaatttcag 

tatttattta 

aggtagaaca 

actctaaaaa 

atattcactt 

ggaaaaactt 

argtttgggg 

gtatgggtca 

ccacatgtct 

tattaatttc 

atgggttgat 

cctgttttat 

attttgttgg 

gcagtttctt 

tgcatttcaa 

tttatttaca 

aagttattta 

atattttgta 

ttaaataaaa 

aaaaa 



accaatctta 
ttatggacca 
cacactgagt 
agattgattc 
aatactttct 
aacagagaac 
aaaaaaatga 
agtcaccctg 
gtggacattg 
gagaaaatga 
cagctatatt 
caaagggttt 
atttttcttt 
tttaaaactt 
ttttttactt 
taaccaatgt 
ctatgtcaat 
atatgtttaa 
tttctatggc 
cagttagtgg 
agacctgttt 



gagctctgaa 
aattgacatt 
tgggaatttc 
catactccgt 
attttttttt 
aaatatgtct 
tttcctgtgc 
aaaacccaca 
tatgtgttta 
atgcttagaa 
attatttatt 
taccctctat 
aaaagaaaaa 
tttatgttct 
tgttttttgt 
ctgtttggct 
ggtttcttaa 
agagataaca 
attccagcgg 
acagtattca 
gggatgtaaa 



agaatttgtc 
ttcgactatt 
ttatcccaga 
tttcaaggag 
caccattgtg 
cagttgtatt 
accttttggc 
aaataaataa 
aattaaaccc 
gctgttcaca 
ttttatgcat 
ttaaatgctt 
tataattatg 
gtggttgatg 
tttgttttgt 
aatgtaatta 
tatttattgt 
gtttgatatg 
atattttggt 
gcaacgcctg 
aaaaaaaaaa 



tttaaatatc 
ttttcccaaa 
agwcggcacg 
aatccctgca 
ggattggact 
aagcacggac 
aacttctctt 
aacttgtaga 
tgtatcactg 
tcttcaagag 
aaagtgaatc 
tgaaaaacag 
aaagccaaga 
ttgtttgttt 
tttggtttdg 
aagttgttaa 
gtagaagtac 
ttttcatgtg 
gtttgcgagg 
atagcttctt 
aaaaaaaaaa 



ttttaatagt 
aaaagtcagg 
agcaatttca 
gtctccttaa 
ttaagaggtg 
ccatattatc 
ttcaatgtag 
tgtgggcaga 
agaagctgtt 
cagaagcaaa 
atttcttctg 
tgcattgaca 
taatctgaag 
gtttgtttct 
catactacat 
tttatatgag 
tggtaatttt 
tttatagcag 
catgcagtca 
tggccttatg 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1265 



<210> 170 
<211> 383 
<212> DNA 
<213> Homo sapiens 



<400> 170 

tgtaagtcga 

tctgtgatac 

taattgatcc 

attttttcta 

aagacaaatg 

ataattatca 

aacatcacac 



gcagtgtgat 
tgatcctgag 
agagaacatg 
caaccattgt 
tgaaaaggat 
aacagcacag 
tgctcgactt 



gacgatattc 
ctaggaggcg 
ctggctacaa 
atgcatgttc 
aatatagttg 
ctacttgcct 
aca 



ttcttattaa 
ctgttcagtt 
ctaataaaac 
tcacagcacc 
gatcaaacaa 
taattttaga 



tgtggtaatt 
aatgggactt 
cgaaaaaagt 
acttttgacc 
aaacaacaca 
gttactcaca 



gaacaaatga 60 
cttcgtactc 120 
gaatttctaa 180 
aatacttcag 240 
atttgtcccg 300 
ttttgtgtgg 360 
383 



<210> 171 
<211> 383 
<212> DNA 
<213> Homo sapiens 
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<400> 171 

tgggcacctt caatatcgca agttaaaaat aatgttgagt ttattatact tttgacctgt 60 
ttagctcaac agggtgaagg catgtaaaga atgtggactt ctgaggaatt ttcttttaaa 120 
aagaacataa tgaagtaaca ttttaattac tcaaggacta cttttggttg aagtttataa 180 
tctagatacc tctacttttt gtttttgctg ttcgacagtt cacaaagacc ttcagcaatt 240 
tacagggtaa aatcgttgaa gtagtggagg tgaaactgaa atttaaaatt attctgtaaa 300 
tactataggg aaagaggctg agcttagaat cttttggttg ttcatgtgtt ctgtgctctt 360 
atcatcacac tgctcgactt aca 383 

<210> 172 
<211> 699 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 641 

<223> n = A,T,C or G 
<400> 172 

tcgggtgatg cctcctcagg cttgtcgtta gtgtacacag agctgctcat gaagcgacag 60 

cggctgcccc tggcacttca gaacctcttc ctctacactt ttggtgcgct tctgaatcta 120 

ggtctgcatg ctggcggcgg ctctggccca ggcctcctgg aaagtttctc aggatgggca 180 

gcactcgtgg tgctgagcca ggcactaaat ggactgctca tgtctgctgt catggagcat 240 

ggcagcagca tcacacgcct ctttgtggtg tcctgctcgc tggtggtcaa cgccgtgctc 300 

tcagcagtcc tgctacggct gcagctcaca gccgccttct tcctggccac attgctcatt 360 

ggcctggcca tgcgcctgta ctatggcagc cgctagtccc tgacaacttc caccctgatt 420 

ccggaccctg tagattgggc gccaccacca gatccccctc ccaggccttc ctccctctcc 480 

catcagcggc cctgtaacaa gtgccttgtg agaaaagctg gagaagtgag ggcagccagg 540 

ttattctctg gaggttggtg gatgaagggg tacccctagg agatgtgaag tgtgggtttg 600 

gttaaggaaa tgcttaccat cccccacccc caaccaagtt nttccagact aaagaattaa 660 

ggtaacatca atacctaggc ctgaggaggc atcacccga 699 

<210> 173 
<211> 701 
<212> DNA 
<213> Homo sapiens 

<400> 173 

tcgggtgatg cctcctcagg ccagatcaaa cttggggttg aaaactgtgc aaagaaatca 60 
atgtcggaga aagaattttg caaaagaaaa atgcctaatc agtactaatt taataggtca 120 
cattagcagt ggaagaagaa atgttgatat tttatgtcag ctattttata atcaccagag 180 
tgcttagctt catgtaagcc atctcgtatt cattagaaat aagaacaatt ttattcgtcg 240 
gaaagaactt ttcaatttat agcatcttaa ttgctcagga ttttaaattt tgataaagaa 300 
agctccactt ttggcaggag tagggggcag ggagagagga ggctccatcc acaaggacag 360 
agacaccagg gccagtaggg tagctggtgg ctggatcagt cacaacggac tgacttatgc 420 
catgagaaga aacaacctcc aaatctcagt tgcttaatac aacacaagct catttcttgc 480 
tcacgttaca tgtcctatgt agatcaacag caggtgactc agggacccag gctccatctc 540 
catatgagct tccatagtca ccaggacacg ggctctgaaa gtgtcctcca tgcagggaca 600 
catgcctctt cctttcattg ggcagagcaa gtcacttatg gccagaagtc acactgcagg 660 
gcagtgccat cctgctgtat gcctgaggag gcatcacccg a 701 

<210> 174 
<211> 700 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> misc_feature 
<222> 19 

<223> n - A,T,C or G 



<400> 174 

tcgggtgatg cctcctcang cccctaaatc 
ggaaagacat agattttaac cggccccctt 
tgcagtttga acagaggcag caaggctagt 
gcctggatct gcctcccagc tctgccagga 
gcagaaagcc ccctgtggac ccagtctcct 
accctttccc ttggtttggc ctcactttca 
tttcctgaaa ccttgtaaaa gaaaaaagtg 
tctacaaaaa atacaaaaat tagttgggtg 
tgggaggtgc tgaggtggga ggatcacttg 
agatcatgcc actgcactcc agcctgagta 
acaacaacag tgagtgtgcc tctgtttccg 
ctcagatttg gacgctgcag cctgaggagg 

<210> 175 
<211> 484 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 30 

<223> n = A,T,C or G 



agagtccagg gtcagagcca caggagacag 60 
caggagattc tgaggctcag ttcactttgt 120 
ggttaggggc acggtctcta aagctgcact 180 
accagctgcg tggccttgag ctgctgacac 240 
cgtctgtaag atgaggacag gactctagga 300 
caggctccca tcttgaactc tatctactct 360 
ctagcctggg caacatggca aaaccctgtc 420 
tggtggcatg tgcctgtagt cccagccact 480 
agcccgggag gtggaggttg cagtgagcca 540 
atagagtaag actctgtctc aaaaacaaca 600 
ggttggatgg ggcaccacat ttatgcatct 660 
catcacccga 700 



<400> 175 

tatagggcga attgggcccg agttgcatgn 
gatgcctcct caggcttgtc tgccacaagc 
atgagggaaa atgtcctact gcactgcgaa 
ccttctaaac cagttaataa attcattcca 
gggactattc tcaggctgaa gaaggtggga 
gccagcttta tatttcaacc atggctggcc 
cctatcgggg catagcccag ggatgccccc 
cttgtcagtg atgacataca ccttagctgc 
ccga 

<210> 176 
<211> 432 
<212> DNA 

<213> Homo sapiens 



tcccggccgc catggccgcg ggattcgggt 60 
tacttctctg agctcagaaa gtgccccttg 120 
tttctcagbt ccattttacc tcccagtcct 180 
caagtattta ctgattacct gcttgtgcca 240 
ggggagggcg gaacctgagg agccacctga 300 
catctgagag catctcccca ctctcgccaa 360 
aggcggccca ggttagatgc gtccctttgg 420 
ttagctggtg ctggcctgag gaggcatcac 480 

484 



<400> 176 

tcgggtgatg cctcctcagg gctcaaggga 

tcatgccacc caggatgaaa atggataggg 

gacaaatgcc aggtagcgga attggtactg 

ccaccatggg acgtcatcgt tcaaatcaac 

ctcccacaca atcgcagttt ggagagatgg 

gccagctcta ccataaccag agtcagggac 

atatgccacc tcggttttct aagaaaggac 
ggcatcaccc ga 

<210> 177 
<211> 788 
<212> DNA 

<213> Homo sapiens 



tgagaagtga cttctttctg gagggaccgt 60 
acccacttgg aggacttgct gatatgtttg 120 
gtccaggagt tatccaggat agattttcac 180 
tcttcaatgg ccatggggga cacatcatgc 240 
gaggcaagtt tatgaaaagc caggggctaa 300 
tcttatccca gctgcaagga cagtcgaagg 360 
agcttaatgc agatgagatt agcctgagga 420 

432 
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<400> 177 

tagcatgttg 

catgctggag 

tggcaatgcc 

cccaactgac 

tactgttcct 

tgctccagtc 

tgaagctctt 

tcgtcgtggc 

cactgctcag 

attggattgc 

tcgccgttct 

cattgttgat 

tttgggtcgc 

acatgcta 



agcccagaca 
ccaagtgcta 
agtggaacca 
aagcccttgc 
gttggccgag 
aacgttacaa 
cctggggaca 
aacgttgctg 
gtgattatcc 
cacacggctc 
ggtaaaaagc 
atggttcctg 
tttgctgttc 



cagtagcatt 
acatgccttg 
cgctgcttga 
gcctgcctct 
tggagactgg 
cggaagtaaa 
atgtgggctt 
gtgacagcaa 
tgaaccatcc 
acattgcatg 
tggaagatgg 
gcaagcccat 
gtgatatgag 



tgtgccaatt 
gttcaaggga 
ggctctggac 
ccaggatgtc 
tgttctcaaa 
atctgtcgaa 
caatgtcaag 
aaatgaccca 
aggccaaata 
caagtttgct 
ccctaaattc 
gtgtgttgag 
acagacagtt 



tctggttgga 
tggaaagtca 
tgcatcctac 
tacaaaattg 
cccggtatgg 
atgcaccatg 
aatgtgtctg 
ccaatggaag 
agtgccggct 
gagctgaagg 
ttgaagtctg 
agcttctcag 
gcggtgggtg 



atggtgacaa 60 
cccgtaagga 120 
caccaactcg 180 
gtggtattgg 240 
tggtcacctt 300 
aagctttgag 360 
tcaaggatgt 420 
cagctggctt 480 
atgcccctgt 540 
aaaagattga 600 
gtgatgctgc 660 
actatccacc 720 
tctgggctca 780 
788 



<210> 178 
<211> 786 
<212> DNA 

<213> Homo sapiens 



<400> 178 

tagcatgttg 

cacttgggct 

attgttgttc 

tatttaatga 

agcctacaaa 

gtttcagagc 

cactgtgcac 

tacacacttt 

gtgtgtttag 

gacactttct 

gattgttaaa 

ccttctgtag 

attgaacagt 

atgcta 



agcccagaca 
gcactttgaa 
gtttcttttc 
cagaaacgtt 
aggcacttgt 
agccagtgat 
tcttctaggt 
atatatatgt 
tattttagct 
tgttgacacc 
gcttaattac 
tgtcgtaggg 
atggcctcat 



cctgtgtttc 
tgacacactt 
attaaaggtt 
ggtacatttc 
tataaatgaa 
tgttccagtc 
gtaagggtat 
atgtatgtat 
tagtgcaact 
ttgaatatta 
tatgatttgt 
ctcttacagg 
tatatatttt 



tgggagctct 
ggctttatta 
taatcagaca 
ttcatgaatg 
agttctggct 
agtgatgcct 
gcaactttgg 
gaaaacatga 
atttccacat 
atgttcaagg 
agtaaattaa 
gtgggaaaga 
gatttatagg 



ggcagtggcg 
gattcactag 
gatcagacag 
agcttgcatt 
ctagaggcca 
agttatatag 
atcttaaaat 
aattagtttg 
tatttattaa 
gtgcaatgtg 
cttttaaaat 
ttttaatttt 
agtttgtgtc 



gattcatagg 60 
tttttaaaaa 120 
cataattttg 180 
ctgaagcaag 240 
gtactctgga 300 
aggaggagta 360 
tctgtacaca 420 
tcaaatatgt 480 
attgatctaa 540 
tattccttta 600 
gtatttgagc 660 
ccagttgcta 720 
tgggctcaac 780 
786 



<210> 179 
<211> 796 
<212> DNA 

<213> Homo sapiens 



<400> 179 

tagcatgttg 

gcttttaaaa 

acacccccat 

ggagtatact 

cccaggccag 

aaataagttt 

ttttaatatc 

tggcaatccc 

gagtagccct 

atgagacctg 

aaagggctat 

tgtccttctt 

taattctaat 

tgggctcaac 



agcccagaca 
agccagtgaa 
ccgccccttg 
tctaattcct 
gtttccactc 
tcctttggag 
attggtatgt 
attaccatgc 
tcctggctac 
aagcacagac 
gtcaggtttc 
catccctagc 
ctatcagcaa 
atgcta 



ctggttacaa 
cctttttaat 
tttggagtgc 
gttgtcctgc 
atttattact 
gaatgtgatt 
gtttggccca 
tttttttaaa 
tggcttaatg 
tgtcttacca 
aatgctatct 
accagaattt 
aataactttt 



gaccagacct 
actttggcaa 
agagtttggc 
acaagctgaa 
ttatgtttct 
ataccccttt 
gaggaaactg 
aaacgtaatt 
tagtcactca 
caaaaggtga 
gcttctgttc 
cccagtctcc 
caaatgtttt 



gcttcctcca 
ccttctttca 
tttggttctt 
taccgagcta 
gttccattgc 
aatttcctcc 
aaattcacca 
tttcttgcct 
gtttctaggt 
caagatctca 
ctgctcactg 
ctccctacct 
aaccggtatc 



tatgtaaaca 60 
caggcaaaga 120 
tgccttgcct 180 
cccaccgcca 240 
tggtccacag 300 
ttttgctttt 360 
tcatcttgac 420 
tacattggca 480 
ggcattaggc 540 
aaccttagcc 600 
ttctggattt 660 
tcccttgttt 720 
tccatgtgtc 780 
796 
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<210> 180 
<211> 488 
<212> DNA 
<213> Homo sapiens 

<400> 180 

ggatgtgctg caaggcgatt aagttgggta 
aaaacgacgg ccagtgaatt gtaatacgac 
catgctcccg gccgccatgg ccgcgggata 
ttggagagat ttttcacgtt accagcttga 
actcccaagg tgaggttgaa gatttcctct 
gcctagaaaa tgattagcat gcaaatttct 
acattcagct gcttcttgtg aactgaaaag 
ctctaacatt gtaacacagt aatctgtgtg 
acatgcta 

<210> 181 
<211> 317 
<212> DNA 
<213> Homo sapiens 

<400> 181 

tagcatgttg agcccagaca cggcgacggt 
aaaggaggaa cgtcatcccc catgatattg 
tcaatgcata tttaatccat gatactgctg 
tgcaggttta tcgggactat tacctcacgg 
ctgtgtgtct agtaagggat gcacatgcag 
ctgggctcaa catgcta 

<210> 182 
<211> 507 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 493 

<223> n = A,T,C or G 
<400> 182 

tagcatgttg agcccagaca ctggctgtta 
tttggtgact caggattaca gaggcatcct 
agcagagagc accacataca ttagaatggt 
tgagggtggt ggtgaatggg aatggaagcc 
aatcctgtct tagtcttagg aaaggaagta 
tgtgtctggg ctcaacatgc tatcccgcgg 
cccaattcgc cctatagtga gtcgtattac 
gactgggaaa accctggcgt tacccaactt 
gctggcgtaa tancgaaaag gcccgca 

<210> 183 
<211> 227 
<212> DNA 
<213> Homo sapiens 

<400> 183 

gatttacgct gcaacactgt ggaggtagcc ctggagcaag gcaggcatgg atgcttctgc 60 



acgccagggt tttcccagtc acgacgttgt 60 

tcactatagg gcgaattggg cccgacgtcg 120 

gcatgttgag cccagacacc tgcaggtcat 180 

tggtcttttt caggaggaga gacactgagc 240 

agatagccgg ataagaagac taggagggat 300 

acctgccatt tcagaactgt gtgtcagccc 360 

agagaggtat tgagactttt ctgatggccg 420 

tgtgtgggtg tgtgtgtgtg tctgggctca 480 

488 



acctgatgag tggggtgatg gcacctgtga 60 
gggacccaga tgatgaacca tggctccgcg 120 
attggaagga cctgaacctg aagtttgtgc 180 
gtgatcaaaa cttcctgaag gacatgtggc 240 
tggccagtgt gccaggggta tggttggtgt 300 

317 



gccaaatcct ctctcagctg ctccctgtgg 60 

gtttcaggga acaaaaagat tttagctgcc 120 

aaggactgcc acctccttca agaacaggag 180 

tgcattccct gatgcatttg tgctctctca 240 

aagtttcaag gacggttccg aactgctttt 300 

ccatggcggc cgggagcatg cgacgtcggg 360 

aattcactgg ccgtcgtttt acaacgtcgt 420 

aatcgccttg cagcacatcc ccctttccca 480 

507 
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aatccccaaa tggagcctgg tatttcagcc aggaatctga gcagagcccc ctctaattgt 120 
agcaatgata agttattctc tttgttcttc aaccttccaa tagccttgag cttccagggg 180 
agtgtcgtta atcattacag cctggtctcc acagtgttgc agcgtaa 227 

<210> 184 
<211> 225 
<212> DNA 
<213> Homo sapiens 

<400> 184 

ttacgctgca acactgtgga gcagattaac atcagacttt tctatcaaca tgactggggt 60 

tactaaaaag acaacaaatc aatggcttca aaagtctaag gaataatttc gatacttcaa 120 

ctttataaaa cctgacaaaa ctatcaatca agcataaaga cagatgaaga acatttccag 180 

attttggcca atcagatatt ttacctccac agtgttgcag cgtaa 225 

<210> 185 
<211> 597 
<212> DMA 
<213> Homo sapiens 

<400> 185 

ggcccgacgt cgcatgctcc cggccgccat ggccgcggga ttcgttaggg tctctatcca 60 

ctgggaccca taggctagtc agagtattta gagttgagtt cctttctgct tcccagaatt 120 

tgaaagaaaa ggagtgaggt gatagagctg agagatcaga tttgcctctg aagcctgttc 180 

aagatgtatg tgctcagacc ccaccactgg ggcctgtggg tgaggtcctg ggcatctatt 240 

tgaatgaatt gctgaagggg agcactatgc caaggaaggg gaacccatcc tggcactggc 300 

acaggggtca ccttatccag tgctcagtgc ttctttgctg ctacctggtt ttctctcata 360 

tgtgaggggc aggtaagaag aagtgcccrg tgttgtgcga gttttagaac atctaccagt 420 

aagtggggaa gtttcacaaa gcagcagctt tgttttgtgt attttcacct tcagttagaa 480 

gaggaaggct gtgagatgaa tgttagttga gtggaaaaga cgggtaagct tagtggatag 540 

agaccctaac gaatcactag tgcggccgcc ttgcaggtcg accatatggg agagctc 597 

<210> 186 
<211> 597 
<212> DNA 
<213> Homo sapiens 

<400> 186 

ggcccgaagt tgcatgttcc cggccgccat ggccgcggga ttcgttaggg tctctatcca 60 
ctacctaaaa aatcccaaac atataactga actcctcaca cccaattgga ccaatccatc 120 
accccagagg cctacagatc ctcctttgat acataagaaa atttccccaa actacctaac 180 
tatatcattt tgcaagattt gttttaccaa attttgatgg cctttctgag cttgtcagtg 240 
tgaaccacta ttacgaacga tcggatatta actgcccctc accgtccagg tgtagctggc 300 
aacatcaagt gcagtaaata ttcattaagt tttcacctac taaggtgctt aaacacccta 360 
gggtgccatg tcggtagcag atcttttgat ttgtttttat ttcccataag ggtcctgttc 420 
aaggtcaatc atacatgtag tgtgagcagc tagtcactat cgcatgactt ggagggtgat 480 
aatagaggcc tcctttgctg ttaaagaact cttgtcccag cctgtcaaag tggatagaga 540 
ccctaacgaa tcactagtgc ggccgcctgc aggtcgacca tatgggagag ctcccaa 597 

<210> 187 
<211> 324 
<212> DNA 
<213> Homo sapiens 

<400> 187 

tcgttagggt ctctatccac ttgcaggtaa aatccaatcc tgtgtatatc ttatagtctt 60 

ccatatgtag tggttcaaga gactgcagtt ccagaaagac tagccgagcc catccatgtc 120 

ttccacttaa ccctgctttg ggttacacat cttaactttt ctgttcaagt ttctctgtgt 180 
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agtttatagc atgagtattg ggawaatgcc ctgaaacctg acatgagatc tgggaaacac 240 
aaacttactc aataagaatt tctcccatat ttttatgatg gaaaaatttc acatgcacag 300 
aggagtggat agagacccta acga 324 

<210> 188 
<211> 178 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 46 

<223> n = A,T,C or G 
<400> 188 

gcgcggggat tcggggtgat acctcctcat gccaaaatac aacgtntaat ttcacaactt 60 
gccttccaat ttacgcattt tcaatttgct ctccccattt gttgagtcac aacaaacacc 120 
attgcccaga aacatgtatt acctaacatg cacatactct taaaactact catccctt 178 

<210> 189 
<211> 367 
<212> DNA 

<213> Homo sapiens 



<400> 189 

tgacaccttg 

atgaatttct 

agacacattc 

aatagctgtt 

aaaaaagctc 

atctcatacc 

ggtgtca 



tccagcatct 
ttagcaagtg 
agtgtccctg 
attcaattga 
aacatcactg 
agtcagaatg 



gacacagtct 
gtataagctg 
aaattagaat 
tggtaggcct 
atcattagaa 
gctattatta 



tggctcttgg 
agaatatacg 
aggacttaca 
taaaagtcaa 
aacttccatt 
aaaagtcaaa 



aaaatattgg 
tatcacatat 
ataagtgtgt 
agaaatgaga 
caaaccccca 
aaataacaga 



ataaatgaaa 60 
cctcattcta 120 
tcactttctc 180 
gggcatgtga 240 
atgagatacc 300 
tgctggacaa 360 
367 



<210> 190 
<211> 369 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 323 

<223> n = A,T,C or G 



<400> 190 

gacaccttgt 

agtttttact 

aattgttcct 

catggggatg 

aaacagatga 

aagaaagcag 

ggtgcccaa 



ccagcatctg 
ctggctaggc 
gcaaggccta 
aacaagatag 
tatatacaaa 
agtcatgatt 



acaacgctaa 
agatggtggc 
tggatagagt 
gctacatcct 
tatataaatg 
tanaatgctg 



cagcctgagg 
taaaacattc 
attgtccagc 
gttcccacag 
aattcaggta 
gaaacagggg 



agatctttat 
atttacccat 
actgctctgg 
aacttccact 
gttttaagta 
ctattgcttg 



ttatttattt 60 
ttattcattt 120 
aagctaggag 180 
ttagtctggg 240 
cgaaaagaat 300 
agatattgaa 360 
369 



<210> 191 
<211> 369 
<212> DNA 

<213> Homo sapiens 



<400> 191 
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tgacaccttg tccagcatct gcacagggaa aagaaactat tatcagagtg aacaggcaac 60 

ctacagaatg ggagaaaatt tttgcaatct atccatctga caaagggcta atatccagaa 120 

tctacaaaga acttatacaa atttacaaga aacaaacaaa caaacaactc ctcaaaaagt 180 

gggtgaagga tgtgaacaga cacttctcaa aagaagacat ttatggggcc aacaaacata 240 

tgaaaaaaag ctcatcatca ctggtcacta gataaatgca aatcaaaacc acaatgagat 300 

accatctcat tccagttaga atggcaatca ttaaaaagtc aggaaacaac agatgctgga 360 
caaggtgtc 369 

<210> 192 
<211> 449 
<212> DNA 
<213> Homo sapiens 

<400> 192 

tgacgcttgg ccacttgaca cttcatcttt gcacagaaaa acttctttac agatttaatt 60 
caagactggt ctagtgacag tcctccagac attttttcat ttgttccata tacgtggaat 120 
tttaaaatca tgtttcatca gtttgaaatg atttgggctg ctaatcaaca caattggatc 180 
gactgttcta ctaaacaaca ggaaaatgtg tatctggcag cctgtggaga aacactaaac 240 
attgattttt ctttgccttt tacggacttt gttccagcta catgtaatac caagttctct 300 
ttaagaggag aagatgttga tcttcatttg tttctaccag actgccaccc tagtaaatat 360 
tctttattta tgctggtaaa aaattgccat ccaaataaga tgattcatga tactggtatt 420 
cctgctgagt gtcaagtggc caagcgtca 449 

<210> 193 

<211> 372 

<212> DNA 

<213> Homo sapiens 

<400> 193 

tgacgcttgg ccacttgaca ccagggatgt akcagttgaa tataatcctg caattgtaca 60 

tattggcaat ttcccatcaa acattctaga aagagacaac caggattgct aggccataaa 120 

agctgcaata aataactggt aattgcagta atcatttcag gccaattcaa tccagtttgg 180 

ctcagaggtg cctttggctg agagaagagg tgagatataa tgtgttttct tgcaacttct 240 

tggaagaata actccacaat agtctgagga ctagatacaa acctatttgc cattaaagca 300 

ccagagtctg ttaattccag tactgataag tgttggagat tagactccag tgtgtcaagt 360 
ggccaagcgt ca 372 

<210> 194 
<211> 309 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 140, 205 
<223> n = A,T f C or G 

<400> 194 

tgacgcttgg ccacttgaca cttatgtaga atccatcgtg ggctgatgca agccctttat 60 
ttaggcttag tgttgtgggc accttcaata tcacactaga gacaaacgcc acaagatctg 120 
cagaaacatt cagttctgan cactcgaatg gcaggataac tttttgtgtt gtaatccttc 180 
acatatacaa aaacaaactc tgcantctca cgttacaaaa aaacgtactg ctgtaaaata 240 
ttaagaaggg gtaaaggata ccatctataa caaagtaact tacaactagt gtcaagtggc 300 
caagcgtca 309 



<210> 195 
<211> 312 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 100, 270 
<223> n « A,T,C or G 

<400> 195 

tgacgcttgg ccacttgaca cccaatctcg cacttcatcc tcccagcacc tgatgaagta 60 

ggactgcaac tatccccact tcccagatga ggggaccaan gtacacatta ggacccggat 120 

gggagcacag atttgtccga tcccagactc caagcactca gcgtcactcc aggacagcgg 180 

ctttcagata aggtcacaaa catgaatggc tccgacaacc ggagtcagtc cgtgctgagt 240 

taaggcaatg gtgacacgga tgcacgtgtn acctgtaatg gttcatcgta agtgtcaagt 300 

ggccaagcgt ca 312 

<210> 196 

<211> 288 

<212> DNA 

<213> Homo sapiens 

<400> 196 

tgtatcgacg tagtggtctc ctcagccatg cagaactgtg actcaattaa acctctttcc 60 
tttatgaatt acccaatctc gggtagtgtc tttatagtag tgtgagaatg gactaataca 120 
agtacatttt acttagtaat aataataaac aaatatatta catttttgtg tatttactac 180 
accatatttt ttattgttat tgtagtgtac accttctact tattaaaaga aataggcccg 240 
aggcgggcag atcacgaggt caggagatgg agaccactac gtcgatac 288 

<210> 197 
<211> 289 
<212> DNA 
<213> Homo sapiens 

<400> 197 

ttgggcacct tcaatatcat gacaggtgat gtgataacca agaaggctac taagtgatta 60 
atgggtgggt aatgtataca gagtaggtac actggacaga ggggtaattc atagccaagg 120 
caggagaagc agaatggcaa aacatttcat cacactactc aggatagcat gcagtttaaa 180 
acctataagt agtttatttt tggaattttc cacttaatat tttcagactg caggtaacta 240 
aactgtggaa cacaagaaca tagataaggg gagaccacta cgtcgatac 289 

<210> 198 

<211> 288 

<212> DNA 

<213> Homo sapiens 

<400> 198 

gtatcgacgt agtggtctcc caagcagtgg gaagaaaacg tgaaccaatt aaaatgtatc 60 

agatacccca aagaaaggcg cttgagtaaa gattccaagt gggtcacaat ctcagatctt 120 

aaaattcagg ctgtcaaaga gatttgctat gaggttgctc tcaatgactt caggcacagt 180 

cggcaggaga ttgaagccct ggccattgtc aagatgaagg agctttgtgc catgtatggc 240 

aagaaagacc ccaatgagcg ggactcctgg agaccactac gtcgatac 288 

<210> 199 

<211> 1027 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 
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<222> 17, 21, 36, 39, 40, 42, 63, 98, 116, 145, 162, 173, 865, 
885, 891, 916, 924, 927, 929, 934, 942, 949, 976, 983, 988, 
989, 1009, 1014 
<223> n = A,T,C or G 

<400> 199 

gctttttggg aaaaacncaa ntgggggaaa gggggnttnn tngcaagggg ataaaggggg 60 
aancccaggg tttccccatt cagggaggtg taaaaagncg gccaggggat tgtaanagga 120 
ttcaataata gggggaatgg gcccngaagt tgcaaggttc cngcccgcca tgnccgcggg 180 
atttagtgac attacgacgs tggtaataaa gtgggsccaa waaatatttg tgatgtgatt 240 
tttsgaccag tgaacccatt gwacaggacc tcatttccty tgagatgrta gccataatca 300 
gataaaagrt tagaagtytt tctgcacgtt aacagcatca ttaaatggag tggcatcacc 360 
aatttcaccc tttgttagcc gataccttcc ccttgaaggc attcaattaa gtgaccaatc 420 
gtcatacgag aggggatggc atggggattg atgatgatat caggggtgat accttcacag 480 
gtgaaaggca tatcctcttg tctatactga ataccacaag tacccttttg accatgtcga 540 
ctagcaaatt tgtctccaat ctgtgtwatc cctaacagag cgtaccctta ttttacaaaa 600 
tttatatcct tcctgattga gagttaccat aacctgatcc acaatgcccg tctcgctwgt 660 
tctgagaaaa gtgctacagt ctctcttggt atagcgtcta ttggtgctct ccaattcatc 720 
ttcatttttc aggcaaggtg aactgttttg cctataataa cmtcatctcc tgatacmcga 780 
aacccckgga rctatcaaac catcatcatc cagcgttckt watgtymcta aatccctatt 840 
gcggccgcct gcaggtcaac atatnggaaa accccccacc ccttnggagc ntaccttgaa 900 
ttttccatat gtcccntaaa ttanctngnc ttancctggc cntaacctnt tccggtttaa 960 
attgtttccg cccccnttcc ccnccttnna accggaaacc ttaattttna accnggggtt 1020 
cctatcc 1027 

<210> 200 
<211> 207 
<212> DNA 

<213> Homo sapiens 
<400> 200 

agtgacatta cgacgctggc catcttgaat cctagggcat gaagttgccc caaagttcag 60 
cacttggtta agcctgatcc ctctggttta tcacaaagaa taggatggga taaagaaagt 120 
ggacacttaa ataagctata aattatatgg tccttgtcta gcaggagaca actgcacagg 180 
tatactacca gcgtcgtaat gtcacta 207 

<210> 201 
<211> 209 
<212> DNA 

<213> Homo sapiens 
<400> 201 

tgggcacctt caatatctat taaaagcaca aatactgaag aacacaccaa gactatcaat 60 
gaggttacat ctggagtcct cgatatatca ggaaaaaatg aagtgaacat tcacagagtt 120 
ttacttcttt gggaactcaa atgctagaaa agaaaagggt gccctctttc tctggcttcc 180 
tggtcctatc cagcgtcgta at gtcacta 209 

<210> 202 

<211> 349 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 1 

<223> n = A,T,C or G 



<400> 202 
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ntacgctgca acactgtgga gccactggtt tttattcccg gcaggttatc cagcaaacag 60 
tcactgaaca caccgaagac cgtggtatgg taaccgttca cagtaatcgt tccagtcgtc 120 
tgcgggaccc cgacgagcgt cactgggtac agaccagatt cagccggaag agaaagcgcc 180 
gcagggagag actcgaactc cactccgctg gtgagcagcc ccatgttttc aactcgaagt 240 
tcaaacggca ttgggttata taccatcagc tgaacttcac acacatctcc ttgaacccac 300 
tggaaatcta ttttcttgtt ccgctcttct ccacagtgtt gcagcgtaa 349 

<210> 203 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<400> 203 

tgctcctctt gccttaccaa cccaaagccc actgtgaaat atgaagtgaa tgacaaaatt 60 

cagttttcaa cgcaatatag tatagtttat ctgattcttt tgatctccag gacactttaa 120 

acaactgcta ccaccaccac caacctaggg atttaggatt ctccacagac cagaaattat 180 

ttctcctttg agtttcaggc tcctctggga ctcctgttca tcaatgggtg gtaaatggct 240 
a 241 

<210> 204 
<211> 248 
<212> DNA 
<213> Homo sapiens 

<400> 204 

tagccattta ccacccatct gcaaaccswg acmwwcargr cywgwackya ggcgatttga 60 

agtactggta atgctctgat catgttagtt acataagtgt ggtcagttta caaaaattca 120 

cagaactaaa tactcaatgc tatgtgttca tgtctgtgtt tatgtgtgtg taatgtttca 180 

attaagtttt tttaaaaaaa agagatgatt tccaaataag aaagccgtgt tggtaaggca 240 
agaggagc 248 

<210> 205 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 447 

<223> n = A,T,C or G 
<400> 205 

tacgctgcaa cactgtggag ccattcatac aggtccctaa ttaaggaaca agtgattatg 60 
ctacctttgc acggttaggg taccgcggcc gttaaacatg tgtcactggg caggcggtgc 120 
ctctaatact ggtgatgcta gaggtgatgt ttttggtaaa caggcggggt aagatttgcc 180 
gagttccttt tacttttttt aacctttcct tatgagcatg cctgtgttgg gttgacagtg 240 
ggggtaataa tgacttgttg gttgattgta gatattgggc tgttaattgt cagttcagtg 300 
ttttaatctg acgcaggctt atgcggagga gaatgttttc atgttactta tactaacatt 360 
agttcttcta tagggtgata gattggtcca attgggtgtg aggagttcag ttatatgttt 420 
gggatttttt aggtagtggg tgttganctt gaacgctttc ttaattggtg gctgctttta 480 
rgcctactat gggtggtaaa tggct 505 

<210> 206 
<211> 179 
<212> DNA 
<213> Homo sapiens 



<400> 206 
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tagactgact catgtcccct accaaagccc atgtaaggag ctgagttctt aaagactgaa 60 
gacagactat tctctggaga aaaataaaat ggaaattgta ctttaaaaaa aaaaaaaatc 120 
ggccgggcat ggtagcacac acctgtaatc ccagctacta ggggacatga gtcagtcta 179 

<210> 207 

<211> 176 

<212> DNA 

<213> Homo sapiens 

<400> 207 

agactgactc atgtccccta ccccaccttc tgctgtgctg ccgtgttcct aacaggtcac 60 

agactggtac tggtcagtgg cctgggggtt ggggacctct attatatggg atacaaattt 120 

aggagttgga attgacacga tttagtgact gatgggatat gggtggtaaa tggcta 176 

<210> 208 
<211> 196 
<212> DNA 
<213> Homo sapiens 

<400> 208 

agactgactc atgtccccta tttaacaggg tctctagtgc tgtgaaaaaa aaaaatgctg 60 

aacattgcat ataacttata ttgtaagaaa tactgtacaa tgactttatt gcatctgggt 120 

agctgtaagg catgaaggat gccaagaagt ttaaggaata tgggtggtaa atggctaggg 180 

gacatgagtc agtcta 196 

<210> 209 
<211> 345 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 53, 56 

<223> n = A,T,C or G 

<400> 209 

gacgcttggc cacttgacac cttttatttt ttaaggattc ttaagtcatt tangtnactt 60 
tgtaagtttt tcctgtgccc ccataagaat gatagcttta aaaattatgc tggggtagca 120 
aagaagatac ttctagcttt agaatgtgta ggtatagcca ggattcttgt gaggaggggt 180 
gatttagagc aaatttctta ttctccttgc ctcatctgta acatggggat aataatagaa 240 
ctggcttgac aaggttggaa ttagtattac atggtaaata catgtaaaat gtttagaatg 300 
gtgccaagta tctaggaagt acttgggcat gggtggtaaa tggct 345 

<210> 210 
<211> 178 
<212> DMA 
<213> Homo sapiens 

<400> 210 

gacgcttggc cacttgacac tagagtaggg tttggccaac tttttctata aaggaccaga 60 

gagtaaatat ttcaggcttt gtgggttgtg cagtctctct tgcaactact cagctctgcc 120 

attgtagcat agaaatcagc catagacagg acagaaatga atgggtggta aatggcta 178 

<210> 211 
<211> 454 
<212> DNA 
<213> Homo sapiens 
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<400> 211 

tgggcacctt 

caccacttgc 

ttttgattcg 

gtagacagca 

atgttccagc 

tgtcctcttt 

tacttctgaa 

cttgttcaca 



caatatctat 
tgtttttgct 
atatcagcac 
tagtgtagag 
aacattaacg 
ttgttgtcaa 
ttcccattgg 
tcagtgtccc 



ccagcgcatc 
catgtatacc 
cgtataagag 
tggtatctcc 
cacattcatc 
ggacattaag 
cagaggccag 
tgagcataac 



taaattcgct 
aagtagcagt 
cagtgctttg 
atactcatct 
ttcctggcat 
ttgacatcgt 
atgtagagca 
ggaa 



tttttcttga 
ggtgtgaggc 
gccattaatt 
ggaatatttg 
tgtacggcct 
ctgtccagca 
gtcctctttt 



ttaaaaattt 60 
catgcttgtt 120 
tatcttcatt 180 
gatcagtgcc 240 
ttgtcagagc 300 
cgagttttac 360 
gcttgtccct 420 
454 



<210> 212 
<211> 337 
<212> DNA 

<213> Homo sapiens 



<400> 212 

tccgttatgc 

tttgcctcag 

tgatcacctg 

acgcagagaa 

gcattttgaa 

cattattgag 



cacccagaaa 
tcctatctga 
ggtttcttta 
actattaaag 
aacaaatttg 
gaaaattaat 



acctactgga 
ttcatgagca 
tttatcgact 
gtattcagaa 
ccgtggaaac 
atcacagcat 



gttacttatt 
catggttatt 
gtgtcatgac 
acgtgaagcc 
tttaatttgt 
aacggaa 



aacatcaagg 
actgatcgca 
aaggaaactt 
agcaattgtt 
tcttgaacag 



ctggaaccta 60 
ttgaaaacat 120 
acaaactgca 180 
tcgcaattcg 240 
tcaagaaaaa 300 
337 



<210> 213 
<211> 715 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 552, 630, 649, 657, 691, 693, 697 
<223> n = A,T,C or G 



<400> 213 

tcgggtgatg 

tttttccttc 

agtatccttc 

aaaaaagtat 

atgtcggtta 

atgcatttga 

acatacagaa 

aagcaccttg 

aattgagctc 

tttgtcaatt 

ttttcatgct 

aatttgactc 



cctcctcagg 
tcttctttac 
ttgtcacctt 
acattagcac 
gctgaacaga 
gaatgcaagc 
atacattaag 
tatagttcct 
tcagttcaga 
tntttagtta 
gctatgaaag 
ccagttcaca 



catcttccat 
tgataaattt 
gcagacttta 
tattaagctt 
attcatttta 
attgtcaaat 
atattagaaa 
cttctaaaat 
ttttaggaga 
atctgtataa 
aaatacccan 
ggcctgagga 



ccatctcttc 
ggactccttc 
aacataaaaa 
ggccttgaaa 
caatgcagag 
aaacatttta 
gtgtttttgc 
tgaagtagat 
attttaacag 
ttttataaat 
gacagggtta 
ngnatcnccc 



aagattagct 
ttgacactga 
tactcattgg 
cattttctat 
tgagaaaaga 
aatgctttct 
ttgtgtacta 
tttaaaaacc 
ggatttggtt 
gtcaaactgt 
tttataaang 
gaaatcctta 



gtcccaaatg 60 
tgacagcttt 120 
ttttaaaagg 180 
cttttattaa 240 
agggagctat 300 
taaagtgagc 360 
ctaattaggg 420 
catgtaattt 480 
ttgtctaaat 540 
atttagtccg 600 
gaaagangtt 660 
ttgcg 715 



<210> 214 
<211> 345 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 6, 8, 15 
<223> n - A,T,C or G 



<400> 214 
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ggtaangngc atacntcggt gctccggccg ccggagtcgg gggattcggg tgatgcctcc 60 
tcaggcccac ttgggcctgc ttttcccaaa tggcagctcc tctggacatg ccattccttc 120 
tcccacctgc ctgattcttc atatgttggg tgtccctgtt tttctggtgc tatttcctga 180 
ctgctgttca gctgccactg tcctgcaaag cctgcctttt taaatgcctc accattcctt 240 
catttgtttc ttaaatatgg gaagtgaaag tgccacctga ggccgggcac agtggctcac 300 
gcctgtaatc ccagcacttt gggagcctga ggaggcatca cccga 

<210> 215 

<211> 429 

<212> DNA 

<213> Homo sapiens 

<400> 215 

ggtgatgcct cctcaggcga agctcaggga ggacagaaac ctcccgtgga gcagaagggc e>u 
aaaagctcgc ttgatcttga ttttcagtac gaatacagac cgtgaaagcg gggcctcacg 120 
atccttctga ccttttgggt tttaagcagg aggtgtcaga aaagttacca cagggataac 180 
tggcttgtgg cggccaagcg ttcatagcga cgtcgctttt tgatccttcg atgtcggctc 240 
ttcctatcat tgtgaagcag aattcaccaa gcgttggatt gttcacccac taatagggaa 300 
cgtgagctgg gtttagaccg tcgtgagaca ggttagtttt accctactga tgatgtgtkg 360 
ttgccatggt aatcctgctc agtacgagag gaaccgcagg ttcasacatt tggtgtatgt 420 
gcttgcctt 

<210> 216 

<211> 593 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 15, 429, 446, 498, 512, 538, 543, 557 
<223> n = A,T,C or G 

<400> 216 

tgacacctat gtccngcatc tgttcacagt ttccacaaat agccagcctt tggccacctc 60 
tctgtcctga ggtatacaag tatatcagga ggtgtatacc ttctcttctc ttccccacca 120 
aagagaacat gcaggctctg gaagctgtct taggagcctt tgggctcaga atttcagagt 180 
cttgggtacc ttggatgtgg tctggaagga gaaacattgg ctctggataa ggagtacagc 240 
cggaggaggg tcacagagcc ctcagctcaa gcccctgtgc cttagtctaa aagcagcttt 300 
ggatgaggaa gcaggttaag taacatacgt aagcgtacac aggtagaaag tgctgggagt 360 
cagaattgca cagtgtgtag gagtagtacc tcaatcaatg agggcaaatc aactgaaaga 420 
agaagaccna ttaatgaatt gcttangggg aaggatcaag gctatcatgg agatctttct 480 
aggaagatta ttgtttanaa ttatgaaagg antagggcag ggacagggcc agaagtanaa 540 
ganaacattg cctatanccc ttgtcttgca cccagatgct ggacaaggtg tea 593 

<210> 217 
<211> 335 
<212> DNA 
<213> Homo sapiens 

<400> 217 ^ ^ ... Cfx 

tgacaccttg tccagcatct gaegtgaaga tgagcagctc agaggaggtg tcctggattt 60 
cctggttctg tgggctccgt ggcaatgaat tcttctgtga agtggatgaa gactacatcc 120 
aggacaaatt taatcttact ggactcaatg agcaggtccc tcactatcga caagctctag 180 
acatgatctt ggacctggag cctgatgaag aactggaaga caaccccaac cagagtgacc 240 
tgattgagca ggcagecgag atgctttatg gattgatcca cgcccgctac atccttacca 300 
accgtggcat cgcccagatg ctggacaagg tgtca 



<210> 218 
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<211> 248 
<212> DNA 
<213> Homo sapiens 

<400> 218 

tacgtactgg tcttgaaggt cttaggtaga gaaaaaatgt gaatatttaa tcaaagacta 6 0 

tgtatgaaat gggactgtaa gtacagaggg aagggtggcc cttatcgcca gaagttggta 120 

gatgcgtccc cgtcatgaaa tgttgtgtca ctgcccgaca tttgccgaat tactgaaatt 180 

ccgtagaatt agtgcaaatt ctaacgttgt tcatctaaga ttatggttcc atgtttctag 240 

tactttta 248 

<210> 219 
<211> 530 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 49, 216, 265, 275, 281, 296, 371, 407, 424, 429, 454, 456, 
458, 464, 474, 476, 506, 509, 527, 530 
<223> n = A,T,C or G 

<400> 219 

tgacgcttgg ccacttgaca caagtagggg ataaggacaa agacccatna ggtggcctgt 60 

cagccttttg ttactgttgc ttccctgtca ccacggcccc ctctgtaggg gtgtgctgtg 120 

ctctgtggac attggtgcat tttcacacat accattctct ttctgcttca cagcagtcct 180 

gaggcgggag cacacaggac taccttgtca gatgangata atgatgtctg gccaactcac 240 

cccccaacct tctcactagt tatangaaga gccangccta naaccttcta tcctgncccc 300 

ttgccctatg acctcatccc tgttccatgc cctattctga tttctggtga actttggagc 360 

agcctggttt ntcctcctca ctccagcctc tctccatacc atggtanggg ggtgctgttc 420 

cacncaaang gtcaggtgtg tctggggaat cctnananct gccnggagtt tccnangcat 480 

tcttaaaaac cttcttgcct aatcanatng tgtccagtgg ccaaccntcn 530 

<210> 220 
<211> 531 
<212> DNA 
<213> Homo sapiens 

<400> 220 

tgacgcttgg ccacttgaca ctaaatagca tcttctaaag gcctgattca gagttgtgga 60 
aaattctccc agtgtcaggg attgtcagga acagggctgc tcctgtgctc actttacctg 120 
ctgtgtttct gctggaaaag gagggaagag gaatggctga tttttaccta atgtctccca 180 
gtttttcata ttcttcttgg atcctcttct ctgacaactg ttcccttttg gtcttcttct 240 
tcttgctcag agagcaggtc tctttaaaac tgagaaggga gaatgagcaa atgattaaag 300 
aaaacacact tctgaggccc agagatcaaa tattaggtaa atactaaacc gcttgcctgc 360 
tgtggtcact tttctcctct ttcacatgct ctatccctct atcccccacc tattcatatg 420 
gcttttatct gccaagttat ccggcctctc atcaaccttc tcccctagcc tactggggga 480 
tatccatctg ggtctgtctc tggtgtattg gtgtcaagtg gccaagcgtc a 531 

<210> 221 
<211> 530 
<212> DNA 
<213> Homo sapiens 

<400> 221 

attgacgctt ggccacttga cacccgcctg cctgcaatac tggggcaagg gccttcactg 60 

ctttcctgcc accagctgcc actgcacaca gagatcagaa atgctaccaa ccaagactgt 120 

tggtcctcag cctctctgag gagaaagagc agaagcctgg aagtcagaag agaagctaga 180 
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tcggctacgg ccttggcagc cagcttcccc acctgtggca ataaagtcgt gcatggctta 240 
acaatggggg cacctcctga gaaacacatt gttaggcaat tcggcgtgtg ttcatcagag 300 
catatttaca caaacctcga tagtgcagcc tactatccac tattgctcct acgctgcaaa 360 
cctgaacagc atgggactgt actgaatact ggaagcagct ggtgatggta cttatttgtg 420 
tatctaaaca cagagaaggt acagtaagaa tatggtatca taaacttaca gggaccgcca 480 
tcctatatgc agtctgttgt gaccaaaatg tgtcaagtgg ccaagcgtca 530 

<210> 222 
<211> 578 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 308, 381, 561, 570, 573 

<223> n « A,T,C or G 

<400> 222 

tgtatcgacg tagtggtctc cgggctacta ggccgttgtg tgctggtagt acctggttca 60 
ctgaaaggcg catctccctc cccgcgtcgc cctgaagcag ggggaggact tcgcccagcc 120 
aaggcagttg tatgagtttt agctgcggca cttcgagacc tctgagccca cctccttcag 180 
gagccttccc cgattaagga agccagggta aggattcctt cctcccccag acaccacgaa 240 
caaaccacca ccccccctat tctggcagcc catatacatc agaacgaaac aaaaataaca 300 
aataaacnaa aaccaaaaaa aaaagagaag gggaaatgta tatgtctgtc catcctgttg 360 
ctttagcctg tcagctccta nagggcaggg accgtgtctt ccgaatggtc tgtgcagcgc 420 
cgactgcggg aagtatcgga ggaggaagca gagtcagcag aagttgaacg gtgggcccgg 480 
cggctcttgg gggctggtgt tgtacttcga gaccgctttc gctttttgtc ttagatttac 540 
gtttgctctt tggagtggga naccactacn tcnataca 578 

<210> 223 
<211> 578 
<212> DNA 
<213> Homo sapiens 

<400> 223 

tgtatcgacg tagtggtctc ctcttgcaaa ggactggctg gtgaatggtt tccctgaatt 60 

atggacttac cctaaacata tcttatcatc attaccagtt gcaaaatatt agaatgtgtt 120 

gtcactgttt catttgattc ctagaaggtt agtcttagat atgttacttt aacctgtatg 180 

ctgtagtgct ttgaatgcat tttttgtttg catttttgtt tgcccaacct gtcaattata 240 

gctgcttagg tctggactgt cctggataaa gctgttaaaa tattcaccag tccagccatc 300 

ttacaagcta attaagtcaa ctaaatgctt ccttgttttg ccagacttgt tatgtcaatc 360 

ctcaatttct gggttcattt tgggtgccct aaatcttagg gtgtgacttt cttagcatcc 420 

tgtaacatcc attcccaagc aagcacaact tcacataata ctttccagaa gttcattgct 480 

gaagcctttc cttcacccag cggagcaact tgattttcta caacttccct catcagagcc 540 
acaagagtat gggatatgga gaccactacg tcgataca 578 

<210> 224 
<211> 345 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 13 

<223> n = A,T,C or G 



<400> 224 

tgtatcgacg tantggtctc ccaaggtgct gggattgcag gcatgagcca ccactcccag 60 
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gtggatcttt ttctttatac ttacttcatt aggtttctgt tattcaagaa gtgtagtggt 120 
aaaagtcttt tcaatctaca tggttaaata atgatagcct gggaaataaa tagaaatttt 180 
ttctttcatc tttaggttga ataaagaaac agaaaaaata gaacatactg aaaataatct 240 
aagttccaac catagaagaa ctgcagaaga aatgaagaaa gtgatgatga tttagatttt 300 
gatattgatt tagaagacac aggaggagac cactacgtcg ataca 345 

<210> 225 

<211> 347 

<212> DNA 

<213> Homo sapiens 

<400> 225 

tgtatcgacg tagtggtctc caaactgagg tatgtgtgcc actagcacac aaagccttcc 60 

aacagggacg caggcacagg cagtttaaag ggaatctgtt tctaaattaa tttccacctt 120 

ctctaagtat tctttcctaa aactgatcaa ggtgtgaagc ctgtgctctt tcccaactcc 180 

cctttgacaa cagccttcaa ctaacacaag aaaaggcatg tctgacactc ttcctgagtc 240 

tgactctgat acgttgttct gatgtctaaa gagctccaga acaccaaagg gacaattcag 300 

aatgctggtg tataacagac tccaatggag accactacgt cgataca 347 

<210> 226 
<211> 281 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 4, 6, 11 
<223> n = A,T,C or G 

<400> 226 

aggngnggga ntgtatcgac gtagtggtct cccaacagtc tgtcattcag tctgcaggtg 60 
tcagtgtttt ggacaatgag gcaccattgt cacttattga ctcctcagct ctaaatgctg 120 
aaattaaatc ttgtcatgac aagtctggaa ttcctgatga ggttttacaa agtattttgg 180 
atcaatactc caacaaatca gaaagccaga aagaggatcc tttcaatatt gcagaaccac 240 
gagtggattt acacacctca ggagaccact acgtcgatac a 281 

<210> 227 

<211> 3646 

<212> DNA 

<213> Homo sapiens 

<400> 227 

gggaaacact tcctcccagc cttgtaaggg ttggagccct ctccagtata tgctgcagaa 60 
tttttctctc ggtttctcag aggattatgg agtccgcctt aaaaaaggca agctctggac 120 
actctgcaaa gtagaatggc caaagtttgg agttgagtgg ccccttgaag ggtcactgaa 180 
cctcacaatt gttcaagctg tgtggcgggt tgttactgaa actcccggcc tccctgatca 240 
gtttccctac attgatcaat ggctgagttt ggtcaggagc accccttccg tggctccact 300 
catgcaccat tcataatttt acctccaagg tcctcctgag ccagaccgtg ttttcgcctc 360 
gaccctcagc cggttcggct cgccctgtac tgcctctctc tgaagaagag gagagtctcc 420 
ctcacccagt cccaccgcct taaaaccagc ctactccctt agggtcatcc catgtctcct 480 
cggctatgtc ccctgtaggc tcatcaccca ttgcctcttg gttgcaaccg tggtgggagg 540 
aagtagcccc tctactacca ctgagagagg cacaagtccc tctgggtgat gagtgctcca 600 
cccccttcct ggtttatgtc ccttctttct acttctgact tgtataattg gaaaacccat 660 
aatcctccct tctctgaaaa gccccaggct ttgacctcac tgatggagtc tgtactctgg 720 
acacattggc ccacctggga tgactgtcaa cagctccttt tgaccctttt cacctctgaa 780 
gagagggaaa gtatccaaag agaggccaaa aagtacaacc tcacatcaac caataggccg 840 
gaggaggaag ctagaggaat agtgattaga gacccaattg ggacctaatt gggacccaaa 900 
tttctcaagt ggagggagaa cttttgacga tttccaccgg tatctcctcg tgggtattca 960 
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gggagctgct 
tgagtcacca 
tgacctggca 
cccagatagt 
agcttttaga 
cagctcaggc 
t agate agee 
ctgactcaaa 
ctggccgacc 
acagtagcag 
cagatgtgtt 
tcgccctcac 
acgcctttgc 
cagcaggtgg 
ggtaaccaga 
aacttgetge 
acagaagaag 
ttcttcctga 
tacagtctac 
agatccccca 
aggtaaatgc 
caggagaaaa 
accttctagt 
aaactgtcaa 
ctgtttgcca 
agtaaggcgt 
caagtagaac 
ggtgtaaatt 
tgggctgggt 
ctaagagatg 
ccccaacagg 
cctgaacaga 
ttccagagag 
aeggctctga 
aacagagccc 
ctaagttggg 
cttctgtcaa 
caatgatttg 
taaccctgtt 
actctccctt 
ttcctcaagg 
gataagatac 
cccctctacc 
ggccctccac 
tagcagtaaa 



cagaaaccta 
ggagtgtttt 



gcccccgaaa 
aaaaggaaac 
gatagectaa 
agctgaaaaa 
tcatttgact 
ctccactatt 
tgatcttcaa 
cttcctcgag 
gtgggctcag 
tcaggctctc 
tactgtgcat 
ctgtaatcca 
aagctgattc 
ccacagtctc 
aaaactggcc 
ctctagaatc 
cacccattta 
tcttcaaagc 
caaaaaaggt 
gtgggaaatt 
actggtagac 
tatggtagtt 
tagggtctga 
taaacattca 
gcatgaactg 
gtgtaagtct 
tcttaccttt 
cccaattggc 
tacaagatat 
cagggccctg 
aaggactccc 
aggtggatgg 
aactagaaac 
tgaagecatt 
acttatgtat 
attcccccaa 
tttagactct 
agctaagagc 
acttaacttg 
tgtggcaagc 
cttcagcaac 
cagcaaaaag 
gttttttttt 



taaacttgtc 
tagagcacct 
atagecatge 
tccaaaaact 
aaggtttttg 
agecactgat 
tcccctccca 
cctgttcatg 
aatgtgcccc 
aagggactac 
gctttaccag 
cgatggggta 
gtacgtggag 
ctgtaaagga 
agcagctcaa 
ctttccacag 
tcagaactca 
ttcatacccc 
ggaggagcaa 
ctaac agate 
cctaaaccca 
gactttacag 
accttctctg 
aagtttttac 
taatggaccg 
atggaagctc 
caccctaaaa 
ccttccttta 
tgaaatcatg 
aaaaatatca 
catcctgcca 
ccattcattc 
tectgettgg 
cattcctgcg 
atgggtcccc 
agattaattc 
cttaagactc 
aaacacaagt 
ccctttcctt 
gccagatgga 
tgcaagctga 
tatatccgea 
caccaccctg 
attctgactc 
ctttttcttt 



66 



taaggegact 
ccaggaggct 
tcttaatttg 
agagggattt 
acagtcaaga 
aaagcatcct 
catggtgttt 
actgtcagga 
taggaaaggt 
gaaaggcegg 
caaacacctc 
aggatattaa 
ccatctacca 
catcaaaagg 
gatgcagtgt 
ccagatctgc 
gagecaataa 
gaactcttgg 
agctacctca 
aagcagctct 
gcccaggcca 
aagtaaaacc 
gatggactga 
tcaatgaaat 
gccttcgcct 
cattgtgcct 
aacactctta 
gccctactta 
tatgggaggg 
caaactaatt 
cttgttcgag 
ccgccaggtg 
aagagacctc 
tggattcatc 
agggctgggt 
tttttcttaa 
aatataaccc 
ggggaatgta 
taatcactca 
ctccatcttg 
ctcccagcac 
gttcccagga 
atcagtcagc 
actgaagact 
ctttttttct 



gaagtegtec 
tatcagattt 
gcatttgtgg 
tgctggaatg 
ggttgaaaaa 
ggagtatcag 
aaatccagct 
actgttggaa 
ggatgccacc 
tgcagctgtt 
agcacaaaag 
cgttaacact 
ggagcgtggg 
aaaacaegge 
gactttcagt 
ctgacaatcc 
aaatcaggaa 
gaaaacttta 
gctcctccgg 
ccggtgcaca 
ccgtctccaa 
acaccgggct 
ageatttget 
catccctcga 
tgtctatagt 
atcgacccca 
caaaattaat 
gagtaaggtg 
tgetgectat 
tattacagta 
gaacccatcc 
acctgttgtt 
acaccgtcat 
actcccgcat 
caggcccctt 
ttttgtaaaa 
ccttgttata 
gtgtccaacc 
gcttgtttcc 
gctctttcac 
atccaagaat 
attegtccaa 
agccatcagc 
tggatgatca 
cgtgcc 



aggggcatga 
acaccccttt 
ctcaggcagc 
aataccagtc 
caaaaacaag 
agtttactgt 
acactacttc 
actactgaaa 
atgttcacag 
accatggaga 
gctgaattga 
gacagcaggt 
ctactcacct 
tgttgcccgt 
cacgcctcta 
cgcatactca 
ggttggtgga 
atcagtcacc 
ageegtttta 
acctgcgccc 
gaaaactcac 
gggtacaaat 
accaaaaacg 
catgggctgc 
ttagtcagtc 
gagctctggg 
cttagaaacc 
caccccttac 
ettgectaag 
cctacagtct 
caatccaatt 
tgttaaaaag 
cacgatgcca 
caaaaaggee 
aaaactgeae 
caatgeatag 
actgaggaat 
tggtttttac 
acctgaattg 
tggcagccgc 
gcaattaact 
ttgatcacag 
accgaggcaa 
ttagtatttt 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3646 



<210> 228 
<211> 419 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 402 

<223> n = A,T,C or G 



<400> 228 

taagagggta caagatctaa gcacagccgt caatgeagaa cacagaacgt agcctggtaa 60 
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gtgtgttaag 
tggtgacggt 
ataaggatag 
ttcgcatgta 
agctgcaaat 
gtaaggaaat 



agtgggaatt 
cccagatggc 
acacaagtga 
tggaaactgc 
actaatttta 
aaccctacat 



tttggagtac 
ttacagaaga 
ggactgactt 
acgtacagga 
tcctgaaagt 
ttcagggtta 



agagtaaggc 
aagtgtcctg 
ggcagtggtg 
atgaagaatg 
tttgaagagt 
ttgtttgttt 



acctaaccct 
agatgagttt 
aatggtgggt 
agactgtgtg 
taactaaaaa 
anatattgaa 



agctggggtt 120 
ttaagaatga 180 
ggcaaaaaac 240 
gtgtttaatg 300 
gtatttttta 360 
ggtgcccaa 419 



<210> 229 
<211> 148 
<212> DNA 

<213> Homo sapiens 
<400> 229 

aagagggtac ctgtatgtag ccatggtggc aatgagagac tgattactac ctgctggaga 60 
ttgtttaagt gagttaatat attaaggata aagggagcca ggttttttga ctgttggaga 120 
aggaaattac agatattgaa ggtcccaa 148 

<210> 230 
<211> 257 
<212> DNA 

<213> Homo sapiens 
<400> 230 

taagagggta cmaaaaaaaa aaaatagaac gaatgagtaa gacctactat ttgatagtac 60 
aacagggtga ctatagtcaa tgataactta attatacatt taacatagag tgtaattgga 120 
ttgtttgtaa ctcgaaggat aaatgcttga gaggatggat accccattct ccatgatgta 180 
cttatttcac attacatgcc tgtatcaaag catctcatat accctataaa tatgtacacc 240 
tactatgtac cctctta 257 

<210> 231 
<211> 260 
<212> DNA 

<213> Homo sapiens 
<400> 231 

taagagggta cgggtatttg ctgatgggat ttttttttct ttctttttct ttggaaaaca 60 
aaatgaaagc cagaacaaaa ttattgaaca aaagacaggg actaaatctg gagaaatgaa 120 
gtcccctcac ctgactgcca tttcattcta tctgaccttc cagtctaggt taggagaata 180 
gggggtggag gggattaatc tgatacaggt atatttaaag caactctgca tgtgtgccag 240 
aagtccatgg taccctctta 260 

<210> 232 
<211> 596 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 437, 440, 461, 536, 541, 565, 580, 587, 590, 595 
<223> n = A,T,C or G 



<400> 232 

tgctcctctt 

gtgggattaa 

atctgtgcac 

ggttcctcaa 

aaaagaactg 

attcataata 



gccttaccaa 
cattatttaa 
tgttggtggg 
aaaaaatttt 
aaatcaggat 
ctcaagagat 



ccacaaatta 
aaaatcagaa 
aatgtaaaaa 
ttttaatcta 
tttgaggaaa 
ggaaacaacc 



gaaccataat 
gtattgacaa 
aggtgtggcc 
ctctatgatc 
tattcacatt 
taaatgtcca 



gagatgtcac 
ggatgtgaag 
actatgggta 
gatcttgagg 
cccacatcca 
tcccgggatg 



ctcatacctg 60 
aaattagaac 120 
acagcatgaa 180 
ttgtttatgc 240 
tttctgcttt 300 
aatggataaa 360 
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cacagtgtgg tatatgcata caatggaata ttatttagtc tttaaaaaga aaaattctat 420 

catatactac aacttanatn aaccttgagg acacaatgct nagtgaaata agccacggaa 480 

ggacgaatac tgcattattc ccttatatga agtatctaaa gtggtcaaac tcttanagca 540 

naaagtaaaa atgggtggtt gccanacagt tggttaggcn agaaganaan cctant 596 

<210> 233 

<211> 96 

<212> DNA 

<213> Homo sapiens 

<400> 233 

tcttctgaag acctttcgcg actcttaagc tcgtggttgg taaggcaaga ggagcgttgg 60 
taaggcaaga ggagcgttgg taaggcaaga ggagca 96 

<210> 234 
<211> 313 
<212> DNA 
<213> Homo sapiens 

<400> 234 

tgtaagtcga gcagtgtgat gataaaactt gaatggatca atagttgctt cttatggatg 60 
agcaaagaaa gtagtttctt gtgatggaat ctgctcctgg caaaaatgct gtgaacgttg 120 
ttgaaaagac aacaaagagt ttagagtagt acataaattt agaatagtac ataaacttag 180 
aatagtacat aaacttagta cataaataat gcacgaagca ggggcagggc ttgagagaat 240 
tgacttcaat ttggaaagag tatctactgt aggttagatg ctctcaaaca gcatcacact 300 
gctcgactta caa 313 

<210> 235 
<211> 550 
<212> DNA 
<213> Homo sapiens 

<400> 235 

aacgaggaca gatccttaaa aagaatgttg agtgaaaaaa gtagaaaata agataatctc 60 
caaagtccag tagcattatt taaacatttt taaaaaatac actgataaaa attttgtaca 120 
tttcccaaaa atacatatgg aagcacagca gcatgaatgc ctatgggrtt gaggataggg 180 
gttgggagta gggatgggga taaaggggga aaataaaacc agagaggagt cttacacatt 240 
tcatgaacca aggagtataa ttatttcaac tatttgtacc wgaagtccag aaagagtgga 300 
ggcagaaggg ggagaagagg gcgaagaaac gtttttggga gaggggtccc asaagagaga 360 
ttttcgcgat gtggcgctac atacgttttt ccaggatgcc ttaagctctg caccctattt 420 
ttctcatcac taatattaga ttaaaccctt tgaagacagc gtctgtggtt tctctacttc 480 
agctttccct ccgtgtcttg cacacagtag ctgttttaca agggttgaac tgactgaagt 540 
gagattattc 550 

<210> 236 

<211> 325 

<212> DNA 

<213> Homo sapiens 

<400> 236 

tagactgact catgtcccct accagagtag ctagaattaa tagcacaagc ctctacaccc 60 
aggaactcac tattgaatac ataaatggaa tttattcagc cttaaaaagt ttggaaggaa 120 
attctgacat atgctaaaac atggatgaac cttgaagact ttatgataag taaaagaagc 180 
cagtcataaa aggaaaaata ttgcatgatt ccacttatat gaggtaccta gagtagtcaa 240 
tttcatagaa acacaaaata gaatggtgtt tgccagggct tttgaggaaa agggaatgac 300 
aagttagggg acatgagtca gtcta 325 



<210> 237 
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<211> 373 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 355 

<223> n = A,T,C or G 
<400> 237 

tagactgact catgtcccct atctactcaa catttccact tgaagtctga taggcatctc 60 
agacttatct tgtcccaaag caaactcttt atttcttttc atcctagtct ttatttcttg 120 
tgctgtctta cccatctcaa aagagtgcca aaatccacca agttgctgaa acagaaatct 180 
aagaaatatc cttgattctt ctttttccca tctacttcac ttctaattca ttagtaaata 240 
atctgtttca gaaaaccaaa cacctcatgt tctcactcat aagggggagt tgaacaatga 300 
gaacacacag acacagggag gggaacatca cacaccacgg cccgtcaggg agtangggac 360 
atgagtcagt eta 373 

<210> 238 
<211> 492 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 272, 310, 380, 435, 474, 484, 488 
<223> n = A,T,C or G 

<400> 238 

tagactgact catgtcccct ataatgetec caggcatcag aaagcatctc aaactggagc 60 
tgacaccatg gcagaggttt caggtaagtc acaaaagggg tcctaaagaa tttgccctca 120 
atatcagagt gattagaaga agtggacaga gctacccaag ttaaacatat gcgagataaa 180 
aaaaatatgg cacttgtgaa cacacactac aggaggaaaa taaggaacat aatagcatat 240 
tgtgctatta tgatgatgaa gaacctctct anaagaaaac ataaccaaag aaacaaagaa 300 
aattcctgen aatgtttaat gctatagaag aaattaacaa aaacatatat tcaatgaatt 360 
cagaaaagtt agcaggtcan aagaaaacaa atcaaagacc agaataatcc cattttagat 420 
tgtcgagtaa actanaacag aaagaatacc actggaaatt gaattcctac gtangggaca 480 
tgantcantc ta 492 

<210> 239 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 245 

<223> n = A,T,C or G 
<400> 239 

tggaaagtat ttaatgatgg geaacttget gtttacttcc tacatatccc atcatcttct 60 
gtattttttt aaataacttt tttttggatt tttaaagtaa ccttattctg agaggtaaca 120 
tggattacat acttctaagc cattaggaga ctctatgtta aaccaaaagg aaatgttact 180 
agatcttcat ttgatcaata ggatgtgata atcatcatct ttctgetcta atggaaaagt 240 
actanaaaca tggaaccata atcttagatg aacaaegtta gaatttgeae taattctacg 300 
gaatttcagt aattcggcaa atgtcgggca gtgacacaac atttcatgac ggggaegcat 360 
ctaccaactt ctggcgataa gggccaccct tccctctgta cttacagtcc catttcatac 420 
acagtctttg attaaatatt cacatttttt ctctacctaa agaccttcaa gaccagtacg 480 
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ta 482 

<210> 240 
<211> 519 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 491 

<223> n « A,T,C or G 
<400> 240 

tgtatcgacg tagtggtctc cccatgtgat agtctgaaat atagcctcat gggatgagag 60 
gctgtgcccc agcccgacac ccgtaaaggg tctgtgctga ggtggattag taaaagagga 120 
aagccttgca gttgagatag aggaagggca ctgtctcctg cctgcccctg ggaactgaat 180 
gtctcggtat aaaacccgat tgtacatttg ttcaattctg agataggaga aaaaccaccc 240 
tatggcggga ggcgagacat gttggcagca atgctgcctt gttatgcttt actccacaga 300 
tgtttgggcg gagggaaaca taaatctggc ctacgtgcac atccaggcat agtacctccc 360 
tttgaactta attatgacac agattccttt gctcacatgt ttttttgctg accttctcct 420 
tattatcacc ctgctctcct accgcattcc ttgtgctgag ataatgaaaa taatatcaat 480 
aaaaacttga nggaactcgg agaccactac gtcgataca 519 

<210> 241 
<211> 771 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 304, 402, 442, 463, 510, 541, 550, 567, 571, 596, 617, 624, 
644, 648, 652, 667, 682, 686, 719, 722, 729, 732, 751, 752, 
757, 758, 760, 763, 766, 769 
<223> n = A,T,C or G 

<400> 241 

tgtatcgacg tagtggtctc cactcccgcc ttgacggggc tgctatctgc cttccaggcc 60 
actgtcacgg ctcccgggta gaagtcactt atgagacaca ccagtgtggc cttgttggct 120 
tgaagctcct cagaggaggg tgggaacaga gtgaccgagg gggcagcctt gggctgacct 180 
aggacggtca gcttggtccc tccgccaaac acgagagtgc tgctgcttgt atatgagctg 240 
cagtaataat cagcctcgtc ctcagcctgg agcccagaga tggtcaggga ggccgtgttg 300 
ccanacttgg agccagagaa gcgattagaa acccctgagg gccgattacc gacctcataa 360 
atcatgaatt tgggggcttt gcctgggtgc tgttggtacc angagacatt attataacca 420 
ccaacgtcac tgctggttcc antgcaggga aaatggttga tcnaactgtc caagaaaacc 480 
actacgtcca taccaatcca ctaattgccn gccgcctgca ggttcaacca tattggggaa 540 
naactccccn ccgccgtttg ggattgncat naacctttga aattttttcc tattanttgt 600 
ccccctaaaa taaaccnttg ggcnttaatc cattgggtcc atancttntt tncccggttt 660 
ttaaaanttg tttatcccgc cncccnattt cccccccaac tttccaaaac ccgaaaccnt 720 
tnaaatttnt tnaaaccctg gggggttccc nnaattnnan ttnaanctnc c 771 

<210> 242 
<211> 167 
<212> DNA 
<213> Homo sapiens 

<400> 242 

tgggcacctt caatatcggg ctcatcgata acatcacgct gctgatgctg ctgttgctgg 60 
tcctctctag gaacctctgg attttcaaat tctttgagga attcatccaa attatctgcc 120 
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tctcctcctt tcctcctttt tctaaggtct tctggtacaa gcggtca 167 

<210> 243 
<211> 338 
<212> DNA 
<213> Homo sapiens 

<400> 243 

ttgggcacct tcaatatcta ctgatctaaa tagtgtggtt tgaggcctct tgttcctggc 60 

taaaaatcct tggcaagagt caatctccac tttacaatag aggtaaaaat cttacaatgg 120 

atattcttga caaagctagc atagagacag caattttaca caaggtattt ttcacctgtt 180 

taataacagt ggttttccta cacccatagg gtgccaccaa gggaggagtg cacagttgca 240 

gaaacaaatt aagatactga agacaacact acttaccatt tcccgtatag ctaaccacca 300 

gttcaactgt acatgtatgt tcttatgggc aatcaaga 338 

<210> 244 
<211> 346 
<212> DNA 
<213> Homo sapiens 

<400> 244 

tttttggctc ccatacagca cactctcatg ggaaatgtct gttctaaggt caacccataa 60 
tgcaaaaatc atcaatatac ttgaagatcc ccgtgtaagg tacaatgtat ttaatattat 120 
cactgataca attgatccaa taccagtttt agtctggcat tgaatcaaat cactgttttt 180 
gttgtataaa aagagaaata tttagcttat atttaagtac catattgtaa gaaaaaagat 240 
gcttatcttt acatgctaaa atcatgatct gtacattggt gcagtgaata ttactgtaaa 300 
agggaagaag gaatgaagac gagctaagga tattgaaggt gcccaa 346 

<210> 245 

<211> 521 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 252, 337, 434, 455, 466, 478, 494, 510, 516 
<223> n = A,T,C or G 

<400> 245 

accaatccca cacggatact gagggacaag tat ate ate c catttcatcc ctacagcagc 60 

aacttcatga ggcaggagtt attagtccca ttttacagaa gaggaaactg agacttaggg 120 

agatcaagta atttgeccag gtcgcacaat tagtgataga gecagggett gaagcgacgt 180 

ctgtcttaag ccaatgaccc ctgeagatta ttagagcaac tgttctccac aacagtgtaa 240 

gcctcttgct anaagctcag gtccacaagg gcagagattt ttgtctgttt tgctcattgc 300 

tccttcccca ttgettagag cagggtctgc cacgaancag gttctcaatg catagttatt 360 

aaatgtatat aagagcaaac atatgttaca gagaactttc tgtatgcttg tcacttacat 420 

gaatcacctg tganatgggt atgcttgttc cccantgttg cagatnaaga tattgaangt 480 

gcccaaatca etanttgegg gcgcctgcan gtccancata t 521 

<210> 246 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 464 

<223> n = A,T,C or G 
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<400> 246 

tggaaccaat ccaaataccc atcaatgata gactggataa agaaaatttg gcacatgttc 60 
accatgaaat actatgcagc cataaaaaag gatgagttca tatcctttgc agggacatgg 120 
atgaagctgg agaccatcat tctcagcaaa ctaacaaggg aacagaaaac caaacactgc 180 
atgttctcac tcttaagtgg gagctgaaca atgagaacac atggacacag ggaggggaac 240 
atcacacagt ggggcctgct ggtgggtagg ggtctagggg agggatagca ttaggagaaa 300 
tacctaatgt agatgacggg ttgatgggtg cagcaaacca ccatgacacg tgtataccta 360 
tgtaacaaac ctgcatgttc tgcacatgta ccccagaact taaagtgtta ataaaaaaat 420 
taagaaaaaa gttaagtatg tcatagatac ataaaatatt gtanatattg aaggtgccca 480 
aa 482 

<210> 247 
<211> 474 
<212> D1SIA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 220, 255, 287, 312, 339, 374, 382, 403, 414, 426, 427, 428, 
432, 433, 434, 435, 436, 465 
<223> n = A,T,C or 6 

<400> 247 

ttcgatacag gcacagagta agcagaaaaa tggctgtggt ttaaccaagt gagtacagtt 60 
aagtgagaga ggggcagaga agacaagggc atatgcaggg ggtgattata acaggtggtt 120 
gtgctgggaa gtgagggtac tcggggatga ggaacagtga aaaagtggca aaaagtggta 180 
agatcagtga attgtacttc tccagaattt gatttctggn ggagtcaaat aactatccag 240 
tttggggtat catanggcaa cagttgaggt ataggaggta gaagtcncag tgggataatt 300 
gaggttatga anggtttggt actgactggt actgacaang tctgggttat gaccatggga 360 
atgaatgact gtanaagcgt anaggatgaa actattccac ganaaagggg tccnaaaact 420 
aaaaannnaa gnnnnngggg aatattattt atgtggatat tgaangtgcc caaa 474 

<210> 248 
<211> 355 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 69, 87, 186, 192, 220, 227, 251, 278, 339, 346, 350 
<223> n = A,T,C or G 

<400> 248 

ttcgatacag gcaaacatga actgcaggag ggtggtgacg atcatgatgt tgccgatggt 60 

ccggatggnc acgaagacgc actggancac gtgcttacgt ccttttgctc tgttgatggc 120 

cctgagggga cgcaggaccc ttatgaccct cagaatcttc acaacgggag atggcactgg 180 

attgantccc antgacacca gagacacccc aaccaccagn atatcantat attgatgtag 240 

ttcctgtaga nggccccctt gtggaggaaa gctccatnag ttggtcatct tcaacaggat 300 

ctcaacagtt tccgatggct gtgatgggca tagtcatant taaccntgtn tcgaa 355 

<210> 249 
<211> 434 
<212> DNA 

<213> Homo sapiens 



<40O> 249 

ttggattggt cctccaggag aacaagggga aaaaggtgac cgagggctcc ctggaactca 60 
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aggatctcca ggagcaaaag gggatggggg aattcctggt cctgctggtc ccttaggtcc 120 
acctggtcct ccaggcttac caggtcctca aggcccaaag ggtaacaaag gctctactgg 180 
acccgctggc cagaaaggtg acagtggtct tccagggcct cctgggcctc caggtccacc 240 
tggtgaagtc attcagcctt taccaatctt gtcctccaaa aaaacgagaa gacatactga 300 
aggcatgcaa gcagatgcag atgataatat tcttgattac tcggatggaa tggaagaaat 360 
atttggttcc ctcaattccc tgaaacaaga catcgagcat atgaaatttc caatgggtac 420 
tcagaccaat ccaa 434 

<210> 250 
<211> 430 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 301, 430 
<223> n = A,T,C or G 



<400> 250 

tggattggtc acatggcaga gacaggattc 
tcactagtta ttattattta ttttattttt 
ggagagcggt ggtgcgatct tggctctctg 
tagcctcgcg ggtagatgga attacaggcg 
gtcttcagta gagacagggt ttcgccatgt 
nagtgatctg ccctcctcgg cctcacaaag 
ccagtcaact tctcactagt tatggcctta 
aaaaaaaaan 



caaggcagtg agaggaggat acaatgcttc 60 
gagatgaagt ctcgctttgt ctcccaggct 120 
caacccccgc ctcaagcaat tctcctgtct 180 
cccaccgcca tgcccaacta atttttttgt 240 
tgggcaggct ggtcttgaac tcctgacctc 300 
tgctggaatt acaggcatgg gctgctgcac 360 
tcattttcac cacattctat tggcccaaaa 420 

430 



<210> 251 
<211> 329 
<212> DNA 
<213> Homo sapiens 



<400> 251 

tggtactcca ccatyatggg gtcaaccgcc 

ggagtctgtg ccgaggtgca gctgrtgcag 

tctctgaaga tctcctgtaa gggttctgga 

gtgcgccagt tgcccgggaa aggcctggag 

gataccagat acagcccgtc cttccaaggc 

agcaccgcct atctgcagtg gagtaccaa 

<210> 252 
<211> 536 
<212> DNA 
<213> Homo sapiens 



atcctcgccc tcctcctggc tgttctccaa 60 
tctggagcag aggtgaaaaa gtccggggag 120 
tacaccttta agate tactg gatcgectgg 180 
tggatggggc tcatctttcc tgatgactct 240 
caggtcacca tctcagtcga taagtccatc 300 

329 



<400> 252 

tggtactcca ctcagcccaa ccttaattaa 

caggctcctc tgctctaacc aggcttctgg 

tatgtagatc ctgtactggc ctaagaagtt 

ttaatggtcg ttgagacttg tgtcctggag 

agaggaagaa ctgcctggaa gggggcatca 

ggcccccttc ccagctctca gectagagta 

tecttgetta gaatgtgcca ceggggggag 

agagtggcat cctatcttct gtgtgcccac 

gtttctggtc caggctttgc ccttgactca 



gaattaagag ggaacctatt actattctcc 60 
gacagtatta gaaaaggatg tctcaacaag 120 
aaactgagaa tagcataaat cagaccaaac 180 
cagctgggat aggaaaactt ttgggcagca 240 
tgttaaaaat tacaagggga acccacacca 300 
ttagcatttc tcagctagag actcacaact 360 
tccctgtggg tgatgaggct ctcaagagtg 420 
aggagectgg cccgagactt agcaggtgaa 480 
ctatgtgacc tctggtggag taccaa 536 



<210> 253 
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<211> 507 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 1 

<223> n = A,T,C or G 
<400> 253 

ntgttgcgat cccagtaact cgggaagctg aggcgggagg atcacctgag ctcaggaggt 60 
tgaggccgca gtgagccggg accacgccac tacactccag cctggggcat agagtgagac 120 
cctccaagac agaaaagaaa agaaaggaag ggaaagggaa agggaaaagg aaaaggaaaa 180 
ggaaaaggaa aaggaaaaga caagacaaaa caagacttga atttggatct cctgacttca 240 
attttatgtt ctttctacac cacaattcct ctgcttacta agatgataat ttagaaaccc 300 
ctcgttccat tctttacagc aagctggaag tttggtcaag taattacaat aatagtaaca 360 
aatttgaata ttatatgcca ggtgtttttc attcctgctc tcacttaatt ctcaccactc 420 
tgatataaat acaattgctg ccgggtgtgg tggctcatgc ctgtaatccc ggcactttgg 480 
gagaccgagg tgggcggats gcaacaa 507 

<210> 254 
<211> 222 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 167 

<223> n = A,T,C or G 
<400> 254 

ttggattggt cactgtgagg aagccaaatc ggatccgaga gtctttttct aaaggccagt 60 
actggccaca ctttctcctg ccgccttcct caaagctgaa gacacacaga gcaaggcgct 120 
tctgttttac tccccaatgg taactccaaa ccatagatgg ttagctnccc tgctcatctt 180 
tccacatccc tgctattcag tatagtccgt ggaccaatcc aa 222 

<210> 255 

<211> 463 

<212> DNA 

<213> Homo sapiens 

<400> 255 

tgttgcgatc cataaatgct gaaatggaaa taaacaacat gatgagggag gattaagttg 60 
gggagggagc acattaaggt ggccatgaag tttgttggaa gaagtgactt ttgaacaagg 120 
ccttggtgtt aagagctgat gagagtgtcc cagacagagg ggccactggt acaatagacg 180 
agatgggaga gggcttggaa ggtgtgcgaa ataggaagga gtttgttctg gtatgagtct 240 
agtgaacaca gaggcgagag gccctggtgg gtgcagctgg agagttatgc agaataacat 300 
taggccctgt gggggactgt agactgtcag caataatcca cagtttggat tttattctaa 360 
gagtgatggg aagccgtgga aagggggtta agcaaggagt gaaattatca gatttacagt 420 
gataaaaata aattggtctg gctactgggg aaaaaaaaaa aaa 463 

<210> 256 
<211> 262 
<212> DNA 
<213> Homo sapiens 



<400> 256 

ttggattggt caacctgctc aactctacyt ttcctccttc ttcctaaaaa attaatgaat 60 
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ccaatacatt aatgccaaaa cccttgggtt ttatcaatat ttctgttaaa aagtattatc 120 

cagaactgga cataatacta cataataata cataacaacc ccttcatctg gatgcaaaca 180 

tctattaata tagcttaaga tcactttcac tttacagaag caacatcctg ttgatgttat 240 

tttgatgttt ggaccaatcc aa 262 

<210> 257 
<211> 461 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 32, 38, 71, 72 
<223> n = A,T,C or G 

<400> 257 

gnggnnnnnn nnncaattcg actcngttcc cntggtancc ggtcgacatg gccgcgggat 
taccgcttgt nnctgggggt gtatggggga ctatgaccgc ttgtagctgg gggtgtatgg 
gggactatga ccgcttgtag mtggkggtgt atgggggact atgaccgctt gtcgggtggt 
cggataaacc gacgcaaggg acgtgatcga agctgcgttc ccgctctttc gcatcggtag 
ggatcatgga cagcaatatc cgcattcgyc tgaaggcgtt cgaccatcgc gtgctcgatc 
aggcgaccgg cgacatcgcc gacaccgcac gccgtaccgg cgcgctcatc cgcggtccga 
tcccgcttcc cacgcgcatc gagaagttca cggtcaaccg tggcccgcac gtcgacaaga 
agtcgcgcga gcagttcgag gtgcgtacct acaagcggtc a 

<210> 258 
<211> 332 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 251 

<223> n = A,T,C or G 
<400> 258 

tgaccgcttg tagctggggg 
ggggactatg accgcttgta 
ggtgtatggg ggactaggac 
tagctggggg tgtatggggg 
accgcttgta nctgggggtg 
aggagagttg tggttgggga 

<210> 259 
<211> 291 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 141, 144, 167, 168, 171, 175, 194, 201, 202, 205, 209, 212, 
235, 236, 245, 246, 258, 266, 268, 270, 273, 277, 285, 290 
<:223> n = A,T,C or G 

<400> 259 

taccgcttgt gaccgcttgt gaccgcttgt gaccgcttgt gaccgcttgt gaccgcttgt 60 
gaccgcttgt gaccgcttgt gaccgcttgt gaccgcttgt gaccgcttgt gaccgcttgt 120 
gaccgcttgt gaccgcttgt nacngggggt gtctggggga ctatgannga ntgtnactgg 180 



tgtatggggg actacgaccg 
gctgggggtg tatgggggac 
cgcttgtagc tgggggtgta 
actacgaccg cttgtagctg 
tatgggggac tatgaccgct 
aaaaaaaaaa aa 



cttgtagctg ggggtgtatg 60 
tatgaccgct tgtagctggg 120 
tgggggacta tgaccgcttg 180 
ggggtgtatg ggggactatg 240 
tgtgctgcct gggggatggg 300 

332 
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gggtgtctgg gggnctatga nngantgtna cngggggtgt ctgggggact atganngact 240 
gtgcnncctg ggggatcnga ggagantngn ggntagngat ggttngggan a 291 

<210> 260 
<211> 238 
<212> DNA 
<213> Homo sapiens 

<400> 260 

taagagggta ctggttaaaa tacaggaaat ctggggtaat gaggcagaga accaggatac 60 
tttgaggtca gggatgaaaa ctagaatttt tttctttttt tttgcctgag aaacttgctg 120 
ctctgaagag gcccatgtat taattgcttt gatcttcctt ttcttacagc cctttcaagg 180 
gcagagccct ccttatcctg aaggaatctt atccttagct atagtatgta ccctctta 238 

<210> 261 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 662, 680, 685, 698, 707, 709, 734, 740, 741 
<223> n « A,T,C or G 

<400> 261 

ttgggcacct tcaatatcaa tagctaacat ttattgagtg tttatcgtat cataaaacac 60 
tgttctaagc ctttaaacgt actaattcat ttaatgctca taatcacttt agaaggtggg 120 
tactagtatt agtctcattt acagatgcaa catgcaggca cagagaggtt aattaacttg 180 
cccaaggtaa cacagctaag aaatagaaaa aatattgaat ctggaaagtt gggcttctgg 240 
gtaacccaca gagtcttcaa tgagcctggg gcctcactca gtttgctttt acaaagcgaa 300 
tgagtaacat cacttaattc agtgagtagg ccaaatggag gtcagctacg agtttctgct 360 
gttcttgcag tggactgaca gatgtttaca acgtctggcc atcagtwaat ggactgatta 420 
tcattgggaw gtgggtgggc tgaatgttgg ccagtgaagt ttattcawgc catattttta 480 
tgtttaggat gacttttggc tggtcctagg gcaagctctg tctgscacgg aacacagaat 540 
wacacaggga ccccctcaat ttctggtgtg gctagaacca tgaaccactg gttgggggaa 600 
caagcggtca aaacctaagt gcggccggct ggcagggtcc acccatatgg ggaaaactcc 660 
cnacgcgttt ggaatgcctn agctngaatt attctaanag ttgtccncnt aaaattagcc 720 
tgggcgttaa tcangggtcn naagcc 746 

<210> 262 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 485, 488, 489, 492, 493, 494, 496, 497, 498, 499, 502, 503, 
504, 506, 521, 537, 550, 564 
<223> n - A,T,C or G 



<400> 262 

tgaccgcttg tcatctcaca tggggtcctg cacgcttttg cctttgtagg aaacctgaca 60 
tttgtctgtt tcttctttct cttttccttc ccatatcctc ctaatttacg tttgacttgt 120 
ttgctgagga ggcaggagct agagactgct gtgagctcat aggggtggga agtttatcct 180 
tcaagtcccg cccactcatc actgcttctc accttcccct gaccaggctt acaagtgggt 240 
tcttgcctgc tttccctttg gacccaacaa gcccctgtaa tgagtgtgca tgactctgac 300 
agctgtggac tcagggtcct tggctacagc tgccatgtaa aatatctcat ccagttctcg 360 
caaattgtta aaataaccac atttcttaga ttccagtacc caaatcatgt ctttacgaac 420 



WO030 13431 f http://www.getlh ep aten^ 245 of 308 



WO 03/013431 PCTYUS02/24917 

77 



tgctcctcac acccagaagt ggcacaataa ttcttgggga attattactt ttttttttct 480 
ctctnttnnc gnnngnnnng gnnngnccag gaattaccac nttggaagac ctggccngaa 540 
tttattatan aggggagccg attntttttc ctaacacaaa gcgggtca 588 

<210> 263 
<211> 730 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_ feature 

<222> 124, 510, 534, 559, 604, 605, 635, 711, 729 
<223> n = A,T,C or G 

<400> 263 

tttttttttt tttggcctga gcaactgaaa ttatgaaatt tccatatact caaaagagta 60 
agactgcaaa aagattaaat gtaaaagttg tcttgtatac agtaatgttt aagataccta 120 
ttanatttat aaatggaaaa ttagggcatt tggatataca agttgaaaat tcaggagtga 180 
ggttgggctg gctgggtata tactgaaaac tgtcagtaca cagatgacat ctaaaaccac 240 
aaatctggtt ttattttagc agtgatatgt gtcactccca caaaagcctt cccaattggc 300 
ctcagcatac acaacaagtc acctccccac agccctctac acataaacaa attccttagt 360 
ttagttcagg aggaaatgcg cccttttcct tccgctctag gtgaccgcaa ggcccagttc 420 
tcgtcaccaa gatgttaagg gaagtctgcc aaagaggcat ctgaaaggaa ataaggggaa 480 
tgggagtgac cacaaaggaa agccaaggan aaactttgga gaccgtttct aganccctgg 540 
catttcacaa caaaactcng gaacaaacct tgtctcatca atcatttaag cccttcgttt 600 
ggannagact ttctgaactg ggcgctgaac ataancctca ttgaatgtct tcacagtctc 660 
ccagctgaag gcacaccttg ggccagaagg ggaatcttcc aggtcctcaa nacagggctc 720 
gccctttgnc 730 

<210> 264 
<211> 715 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miscfeature 

<222> 364, 451, 476, 494, 495, 515, 519, 524, 633, 635, 636, 645, 
647, 649, 657, 692, 695, 701, 707, 710, 713 
<223> n = A,T,C or G 

<400> 264 

tttttttttt tttggccagt atgatagtct ctaccactat attgaagctc ttaggtcatt 60 
tacacttaat gtggttatag atgctgttga gcttacttct accaccttgc tatttctccc 120 
gtctcttttt tgttcctttt ctcttctttt cctcccttat tttataattg aattttttag 180 
gattctattt tatatagatt tatcagctat aacactttgt attcttttgt tttgtggttc 240 
ttctgtcatt tcaatgtgca tcttaaactc atcacaatct attttcaaat aatatcatat 300 
aaccttacat ataatgtaag aatctaccac catatatttc catttctccc ttccatccta 360 
tgtntgtcat attttttcct ttatatatgt tttaaagaca taatagtata tgggaggttt 420 
ttgcttaaaa tgtgatcaat attccttcaa ngaaacgtaa aaattcaaaa taaatntctg 480 
tttattctca aatnnaccta atatttccta ccatntctna tacntttcaa gaatctgaag 540 
gcattggttt tttccggctt aagaacctcc tctaaagcac tctaagcaga attaagtctt 600 
ctgggagagg aattctccca agcttgggcc ttnanntgta ctccntnang gttaaanttt 660 
ggccgggaaa tagaaattcc aagttaacag gntanttttt ntttttnttn tcncc 715 

<210> 265 
<211> 152 
<212> DNA 
<213> Homo sapiens 



WO03013431 rhttp:/A*ww.qett^^^ 



WO 03/013431 PCT/US02/24917 

78 



<400> 265 

tttttttttt tttcccaaca caaagcacca ttatctttcc tcacaatttt caacatagtt 60 

tgattcccat gaagaggtta tgatttctaa agaaaacatg gctactatac tatcaatcag 120 

ggttaaatct tttttttttg agacggagtt ta 152 

<210> 266 
<211> 193 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 180 

<223> n = A,T,C or G 
<400> 266 

taaactccgt ccccttctta atcaatatgg aggctaccca ctccacatta ccttcttttc 60 
aagggactgt ttccgtaact gttgtgggta ttcacgacca ggcttctaaa cctcttaaaa 120 
ctccccaatt ctggtgccaa cttggacaac atgctttttt tttttttttt tttttttttn 180 
gagacggagt tta 193 

<210> 267 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<400> 267 

tgttgcgatc ccttaagcat gggtgctatt aaaaaaatgg tggagaagaa aatacctgga 60 

atttacgtct tatctttaga gattgggaag accctgatgg aggacgtgga gaacagcttc 120 

ttcttgaatg tcaattccca agtaacaaca gtgtgtcagg cacttgctaa ggatcctaaa 180 

ttgcagcaag gctacaatgc tatgggattc tcccagggag gccaatttct gagggcagtg 240 

gctcagagat gcccttcacc tcccatgatc aatctgatct cggttggggg acaacatcaa 300 

ggtgtttttg gactccctcg atgcccagga gagagctctc acatctgtga cttcatccga 360 

aaaacactga atgctggggc gtactccaaa gttgttcagg aacgcctcgt gcaagccgaa 420 
tactggcatg acccataaaa ggaggatgtg gatcgcaaca 460 

<210> 268 
<211> 533 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 450, 470 
<223> n = A,T,C or G 

<400> 268 

tgttgcgatc cgttgataga atagcgacgt ggtaatgagt gcatggcacg cctccgactt 60 
accttcgccc gtggggaccc cgagtacgtc tacggcgtcg tcacttagag taccctctgg 120 
acgcccgggc gcgttcgatt taccggaagc gcgagctgca gtgggcttgc gcccccggcc 180 
aaattctttg gggggtttaa ggccgcgggg aatttgaggt atctctatca gtatgtagcc 240 
aagttggaac agtcgccatt cccgaaatcg ctttctttga atccgcaccg cctccagcat 300 
tgcctcattc atcaacctga aggcacgcat aagtgacggt tgtgtcttca gcagctccac 360 
tccataacta gcgcgctcga cctcgtcttc gtacgcgcca ggtccgtgcg tgcgaattcc 420 
caactccggt gagttgcgca tttcaagttn cgaaactgtt cgcctccacn atttggcatg 480 
ttcacgcatg acacggaata aactcgtcca gtaccgggaa tgggatcgca aca 533 
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<210> 269 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 269 

tttttttttt ttcgcctgaa ttagctacag atcctcctca caagcggtca 

<210> 270 
<211> 519 
<212> DNA 
<213> Homo sapiens 



50 



<400> 270 

tgttgcgatc 

gctgagtcac 

tgcacaccag 

tgtccaccac 

caaaagcgtc 

tcaccaggaa 

taatgggctc 

ccagagtctg 

ccttgggttg 



caaataaccc 
gtgaacggtc 
ctcagggctg 
ctcctggcac 
gagtatttct 
ttcacacacc 
caccagttcc 
catgggcatc 
catgtgcatc 



accagcttct 
agtgcaagca 
acctcctcca 
tcttccgaca 
ttctcagtct 
tcacagtaaa 
agggcaggga 
tctttcacct 
atcatctggg 



tgcacacttc 
gccgcgtgcc 
gcaggatgga 
gggacttcgg 
tgttgttgtc 
catcagactt 
tgacattctt 
catcacagaa 
atcgcaaca 



gcagaagcca 
agagcagagg 
caggatggag 
cagcttcgag 
aatcagcttg 
tgctgggacc 
ggaggccact 
cccaaccagc 



ccgtcctttg 60 
tgcagcatgc 120 
ctgccgtacg 180 
cacattttgt 240 
gtcacctcct 300 
tcgtgcttct 360 
ttggcgggga 420 
gcac agate t 480 
519 



<210> 271 
<211> 457 
<212> DNA 
<213> Homo sapiens 



<400> 271 

tttttttttt 

ccaatggccc 

gaacagcaca 

aaaagctggt 

attagtgaag 

ccaaataatg 

ggaacacctc 

ttctctgaag 



ttegggegge 
gctatgagga 
atggcaagac 
gccccgactg 
gatgtgtgtt 
acttcagaaa 
aaaaactggt 
attaagattt 



gaeeggaegt 
ggtgagcgtg 
cattttcgcc 
cgtgcaggct 
catctactgc 
aaacttgaaa 
agaatctgag 
taggatggca 



gcactcctcc 
tccggcttcg 
tactttaegg 
gaaccagtcg 
caagtaggag 
gtaacagcag 
tgtcttcagg 
atcaaga 



agtagegget 
aggagttcca 
gttctaagga 
tacgagaggg 
aagagectta 
tgcctacact 
ccaacctggt 



geaegtegtg 60 
ccgggccgtg 120 
cgccgggggg 180 
gctgaagcac 240 
ttggaaagat 300 
acttaagtat 360 
ggaaatgttg 420 
457 



<210> 272 
<211> 102 
<212> DNA 
<213> Homo sapiens 

<400> 272 

tttttttttt ttgggcaaca acctgaatac cttttcaagg ctctggcttg ggctcaagcc 60 
cgcaggggaa atgcaactgg ccaggtcaca gggcaatcaa ga 102 

<210> 273 
<211> 455 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 380, 415 r 454 
<223> n = A,T,C or G 
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<400> 273 

tttttttttt 

ggcaatcaac 

gtttaagtct 

ggccgaagtt 

tctcgtgttt 

ggcaatcaag 

gtttaagtct 

ggccgaagtt 



ttggcaatca 
aggtttaagt 
tcggccgaag 
aatctcgtgt 
ttggcaatca 
aggtttaagt 
tcggccgaan 
aatctcgtgt 



acaggtttaa 
cttcggccga 
ttaatctcgt 
ttttggcaat 
acaggtttaa 
cttcggccga 
ttaatctcgt 
ttttggcaat 



gtcttcggcc 
agttaatctc 
gtttttggca 
caacaggttt 
gtcttcggcc 
agttaatctc 
gtttttggca 
caana 



gaagttaatc 
gtgtttttgg 
atcaacaggt 
aagtcttcgg 
gaagttaatc 
gtgtttttgg 
atcaacaggt 



tcgtgttttt 60 
caatcaacag 120 
ttaagtcttc 180 
ccgaagttaa 240 
tcgtgttttt 300 
caatcaacag 360 
ttaantcttc 420 
455 



<210> 274 
<211> 461 
<212> DNA 
<213> Homo sapiens 



<400> 274 

tttttttttt 

tggcaaacca 

tccctgggat 

acagaaccaa 

aattcaacag 

caaaataata 

actggaagca 

attcaacata 



ttggccaata 
aatccagcag 
gcaaggctgg 
agacaaaaac 
cccttcatgc 
agagctattt 
ttccctttga 
gtattggaag 



cccttgatga 
cacatcaaaa 
ttcaacataa 
cacatgatta 
taaacactct 
atgacaaacc 
aaactggcac 
ttctggccag 



acatcaatgt 
agcttatcca 
gaaaatcaat 
tctcaataga 
taataaacta 
cacagccaat 
aagacaagga 
ggcaatcaag 



gaaaatcctc 
ccatgatcaa 
aaatgtaatc 
tgcagaaaag 
gatattgatg 
atcatactga 
tgccctctct 



ggtaaaatac 60 
gtgggcttca 120 
catcacataa 180 
gccttggaca 240 
gaatgtatct 300 
atgggcaaag 360 
caccgctcct 420 
461 



<210> 275 
<211> 729 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 164, 193, 207, 215, 216, 220, 223, 241, 244, 254, 269, 271, 
275, 290, 295, 298, 309, 318, 325, 326, 331, 352, 380, 401, 
411, 420, 424, 426, 431, 433, 435, 438, 440, 442, 443, 448, 
453, 464, 465, 468, 474, 475, 481, 487, 491, 503, 516 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 519, 530, 531, 542, 547, 549, 559, 561, 564, 582, 586, 587, 
588, 589, 592, 595, 612, 614, 620, 631, 632, 635, 636, 644, 
646, 649, 650, 651, 655, 657, 660, 661, 662, 663, 666, 672, 
673, 674, 682, 687, 691, 693, 697, 700, 701, 704, 705 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 713, 715, 717, 718, 722, 726, 727 
<223> n = A,T,C or G 



<400> 275 

tttttttttt 

catgaaaaca 

ctccccaaac 

cttgcaaatt 

ncantaagca 

ccctttctnt 

atttgggaga 

accngncttg 



ttggccaaca 
taggaaggtg 
cccaccttca 
canggatgtt 
gaantacgat 
cgtgttanga 
actccccccn 
ngngncantn 



ccaagtcttc 
gctgttacag 
cagcctcttc 
ggaagtngac 
gactttgana 
tctcnngtcc 
cgttggatcc 
cnncctcnca 



cacgtgggag 
caaacatttc 
cacacgtctc 
atttnnagtn 
nacanctgat 
ntcactaatg 
ccccttgagt 
ccntgtttcc 



gttttattat 
agatagacga 
ccanagattg 
gcnggaaccc 
gaagaacacn 
cggccccctg 
ntcccattct 
ctgnngtnaa 



gttttacaac 60 
atcggccaag 120 
ttgtccttca 180 
catcagtgaa 240 
ctacnganaa 300 
cnggtccacc 360 
ngtcccccan 420 
aatnngtttt 480 
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nccgccnccc naattcccac ccnaatcaca gcgaanccng aaggccttcn naagtgttta 540 
angcccngng gtttcctcnt ntanttgcag cctaccctcc cncttnnnnt tncgngttgg 600 
tcgcgccctg gncncgcctn gttcctcttt nngganacaa cctngntcnn nggcncntcn 660 
nnnctnttcc tnnnactagc tngcctntcc ncnccgnggn ncanngcaca ttncncnnac 720 
tntgtnncc 729 

<210> 276 
<211> 339 
<212> DNA 

<213> Homo sapiens 
<400> 276 

tgacctgaca tgtagtagat acttaataaa tatttgtgga atgaatggat gaagtggagt 60 
tacagagaaa aatagaaaag tacaaattgt tgtcagtgtt ttgaaggaaa attatgatct 120 
ttcccaaagt tctgacttca ttctaagaca gggttagtat ctccatacat aattttactt 180 
gcttttgaaa atcaaatgag ataatctatt tagattgata atttatttag actggctata 240 
aactattaag tgctagcaaa tatacatttt aatctcattt tccacctctt gtgatatagc 300 
tatgtaggtg ttgactttaa tggatgtcag gtcaatccc 339 

<210> 277 
<211> 664 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 267, 534, 590, 601, 646, 657 
<223> n = A, T,C or G 

<400> 277 

tgacctgaca tccataacaa aatctttctc cattatattc ttctagggga atttcttgaa 60 
aagcatccaa aggaaacaaa tgatggtaag accgtgccaa gtggggagca gacaccaaag 120 
taagaccaca gattttacat tcaacaggta gctcacagta ctttgcccga cactgtgggc 180 
agaaatagcc tcctaatgta agccctggct cagtattgcc atccaaatgc gccatgctga 240 
aagagggttt tgcatcctgg tcagatnaag aagcaatggt gtgctgagga aatcccatac 300 
gaataagtga gcattcagaa cttgagctag caggaggagg actaagatga tgtgtgagca 360 
actctttgta atggctttca tctaaaataa catggtacgt gccaccagtt tcacgagcaa 420 
gtacagtgca aacgcgaact tctgcagaca atccaataac agatactcta attttagctg 480 
cctttagggt cttgattaaa tcataaatat tagatggatc gcaagttgta aggntgctaa 540 
aagatgatta gtacttctcg acttgtatgt ccaggcatgt tgttttaaan tctgccttag 600 
nccctgctta ggggaatttt taaagaagat ggctctccat gttcanggtc aatcacnaat 660 
tgcc 664 

<210> 278 
<211> 452 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 430 

<223> n = A,T,C or G 
<400> 278 

tgacctgaca ttgaggaaga gcacacacct ctgaaattcc ttaggttcag aagggcattt 60 
gacacagagt gggcctctga taattcatga aatgcattct gaagtcatcc agaatggagg 120 
ctgcaatctg ctgtgctttg ggggttgcct cactgtgctc ctggatatca cacaaaagct 180 
gcaatccttc ttcttcaact aacattttgc agtatttgct gggattttta ctgcagacat 240 
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gatacatagc ccatagtgcc cagagctgaa cctctggttg agagaagttg ccaaggagcg 300 
ggaaaaatgt cttgaaagat ctataggtca ccaatgctgt catcttacaa cttgaacttg 360 
gccaattctg tatggttgca tgcagatctt ggagaagagt acgcctctgg aagtcacggg 420 
atatccaaan ctgtctgtca gatgtcaggt ca 452 

<210> 279 
<211> 274 
<212> DNA 
<213> Homo sapiens 

<400> 279 

tttttttttt ttcggcaagg caaatttact tctgcaaaag ggtgctgctt gcacttttgg 60 
ccactgcgag agcacaccaa acaaagtagg gaaggggttt ttatccctaa cgcggttatt 120 
ccctggttct gtgtcgtgtc cccattggct ggagtcagac tgcacaatct acactgaccc 180 
aactggctac tgtttaaaat tgaatatgaa taattaggta ggaaggggga ggctgtttgt 240 
tacggtacaa gacgtgtttg ggcatgtcag gtca 274 

<210> 280 
<211> 272 
<212> DNA 
<213> Homo sapiens 

<400> 280 

tacctgacat ggagaaataa cttgtagtat tttgcgtgca atggaatact atatgagggt 60 
gaaaatgaat gaactagcaa tgcgtgtatc aacatgaata aatccccaaa acataataat 120 
gttgaatgga aaaggtgagt ttcagaagga tatatatgcc ctctaaatcc atttatgtaa 180 
acctttaaaa aactacatta tttatggtca taagtccatc cagaaaatat ttaaaaacct 240 
acatgggatt gataactact gatgtcaggt ca 272 

<210> 281 
<211> 431 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 339, 420, 430, 431 
<223> n = A,T,C or G 

<400> 281 

tttttttttt ttggccaata gcatgattta aacattggaa aaagtcaaat gagcaatgcg 60 
aatttttatg ttctcttgaa taatcaaaag agtaggcaac attggttcct cattcttgaa 120 
tagcattaat cagaaaatat tgcatagcct ctagcctcct tagagtaggt gtgctctctc 180 
aaatatatca tagtcccaca gtttatttca tgtatatttt ctgcctgaat cacatagaca 240 
tttgaatttg caacgcctga tgtaaatata taaattctta ccaatcagaa acatagcaag 300 
aaattcaggg acttggtcat yatcagggta tgacagcana tccctgtara aacactgata 360 
cacactcaca cacgtatgca acgtggagat gtcgcyttww kkktwywcwm rmrycrwcgn 420 
aatcacttan n 431 

<210> 282 

<211> 98 

<212> DNA 

<213> Homo sapiens 

<400> 282 

attcgattcg atgcttgagc ccaggagttc aagactgcag tgagccactg cacttcaggc 60 
tggacaacag agcgagtccc tgtgccaaaa aaaaaaaa 98 



W O0301 3431 [ http:/ /www. q etth e patent.com/Login.doq/$patents4/ @ 21 0 1 21 %2E419C12/FetchA/VO 0 301 3431 .cpc?fromCache=1 part^maintooibar^tPa gelSJ^^Og 



WO 03/013431 PCT/US02/24917 

83 



<210> 283 
<211> 764 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 372, 374, 379, 380, 381, 382, 384, 387, 389, 392, 402, 409, 
411, 419, 421, 432, 440, 447, 452, 457, 466, 470, 471, 480, 
483, 492, 503, 506, 510, 512, 518, 520, 521, 524, 531, 534, 
536, 542, 545, 547, 550, 552, 553, 562, 566, 567, 575 
<223> n = A,T,C or G 

<221> misc_f eature 

<222> 580, 581, 584, 586, 587, 595, 598, 601, 603, 604, 606, 624, 
629, 630, 646, 651, 652, 653, 656, 659, 664, 665, 681, 691, 
700, 706, 709, 721, 724, 731, 732, 737, 741, 744, 745, 750, 
753, 754, 758 
<223> n = A,T,C or G 

<400> 283 

tttttttttt ttcgcaagca cgtgcacttt attgaatgac actgtagaca ggtgtgtggg 60 
tataaactgc tgtatctagg ggcaggacca agggggcagg ggcaacagcc ccagcgtgca 120 
gggccascat tgcacagtgg astgcaaagg ttgcaggcta tgggcggcta ctavtaaccc 180 
cgtttttcct gtattatctg taacataata tggtagactg tcacagagcc gaatwccart 240 
hacasgatga atccaawggt caygaggatg cccasaatca gggcccasat sttcaggcac 300 
ttggcggtgg gggcatasgc ctgkgccccg gtcacgtcsc caaccwtcty cctgtcccta 360 
cmcttgawtc cncnccttnn nntnccntna tntgcccgcc cncctcctng ngtcaaccng 420 
natctgcact anctccctcn ccccttntgg antctcntcc ttcaantaan nttatccttn 480 
acncccccct cncctttccc ctnccncccn tnatcccngn nccnctatca ntcntnccct 540 
cnctntnctn cnnatcgttc cncctnntaa ctacnctttn nacnanncct cactnatncc 600 
ngnnanttct ttccttccct cccnacgcnn tgcgtgcgcc cgtctngcct nnnctncgna 660 
cccnnacttt atttaccttt ncaccctagc nctctacttn acccanccnc tcctacctcc 720 
nggnccaccc nnccctnatc nctnnctctn tcnnctcntt cccc 764 

<210> 284 
<211> 157 
<212> DNA 
<213> Homo sapiens 

<400> 284 

caagtgtagg cacagtgatg aaagcctgga gcaaacacaa tctgtgggta attaacgttt 60 
atttctcccc ttccaggaac gtcttgcatg gatgatcaaa gatcagctcc tggtcaacat 120 
aaataagcta gtttaagata cgttccccta cacttga 157 

<210> 285 
<211> 150 
<212> DNA 
<213> Homo sapiens 

<400> 285 

attcgattgt actcagacaa caatatgcta agtggaagaa gtcagtcaca aaagaccaca 60 
tactgtatga cttcatttac attaagtgtc cagaataggc aaatccgtag agacagaaag 120 
tagatgagca gctgcctagg tctgagtaca 150 

<210> 286 
<211> 219 
<212> DNA 
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<213> Homo sapiens 
<400> 286 

attcgatttt tttttttttg gccatgatga aattcttact ccctcagatt ttttgtctgg 60 
ataaatgcaa gtctcaccac cagatgtgaa attacagtaa actttgaagg aatctcctga 120 
gcaaccttgg ttaggatcaa tccaatattc accatctggg aagtcaggat ggctgagttg 180 
caggtcttta caagttcggg ctggattggt ctgagtaca 219 

<210> 287 
<211> 196 
<212> DNA 

<213> Homo sapiens 
<400> 287 

attcgattct tgaggctacc aggagctagg agaagaggca tggaacaaat tttccctcat 60 
atccatactc agaaggaacc aaccctgctg acaccttaat ttcagcttct ggcctctaga 120 
actgtgagag agtacatttc tcttggttta agccaagaga atctgtcttt tggtacttta 180 
tatcatagcc tcaaga 196 

<210> 288 
<211> 199 
<212> DNA 

<213> Homo sapiens 
<400> 288 

attcgatttc agtccagtcc cagaacccac attgtcaatt actactctgt araagattca 60 
tttgttgaaa ttcattgagt aaaacattta tgatccctta atatatgcca attaccatgc 120 
taggtactga agattcaagt gaccgagatg ctagcccttg ggttcaagtg atccctctcc 180 
cagagtgcac tggactgaa 199 

<210> 289 
<211> 182 
<212> DNA 

<213> Homo sapiens 
<400> 289 

attcgattct tgaggctaca aacctgtaca gtatgttact ctactgaata ctgtaggcaa 60 
tagtaataca gaagcaagta tctgtatatg taaacattaa aaaggtacag tgaaacttca 120 
gtattataat cttagggacc accattatat atgtggtcca tcattggcca aaaaaaaaaa 180 
aa 182 

<210> 290 
<211> 1646 
<212> DNA 

<213> Homo sapiens 
<400> 290 

ggcacgagga gaaatgtaat tccatatttt atttgaaact tattccatat tttaattgga 60 
tattgagtga ttgggttatc aaacacccac aaactttaat tttgttaaat ttatatggct 120 
ttgaaataga agtataagtt gctaccattt tttgataaca ttgaaagata gtattttacc 180 
atctttaatc atcttggaaa atacaagtcc tgtgaacaac cactctttca cctagcagca 240 
tgaggccaaa agtaaaggct ttaaattata acatatggga ttcttagtag tatgtttttt 300 
tcttgaaact cagtggctct atctaacctt actatctcct cactctttct ctaagactaa 360 
actctaggct cttaaaaatc tgcccacacc aatcttagaa gctctgaaaa gaatttgtct 420 
ttaaatatct tttaatagta acatgtattt tatggaccaa attgacattt tcgactattt 480 
tttccaaaaa agtcaggtga atttcagcac actgagttgg gaatttctta tcccagaaga 540 
ccaaccaatt tcatatttat ttaagattga ttccatactc cgttttcaag gagaatccct 600 
gcagtctcct taaaggtaga acaaatactt tctatttttt tttcaccatt gtgggattgg 660 
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actttaagag 
gacccatatt 
cttttcaatg 
agatgtgggc 
ctgagaagct 
gagcagaagc 
atcatttctt 
cagtgcattg 
agataatctg 
tttgtttgtt 
kgcatactac 
atttatatga 
ctggtaattt 
gtttatagca 
gcatgcagtc 
ttggccttat 
aaaaaaaaaa 



gtgactctaa 
atcatattca 
tagggaaaaa 
agaaggtttg 
gttgtatggg 
aaaccacatg 
ctgtattaat 
acaatgggtt 
aagcctgttt 
tctattttgt 
atgcagttct 
gtgcatttca 
ttttatttac 
gaagttattt 
aatattttgt 
gttaaataaa 
aaaaaaaaaa 



aaaaacagag 
cttaaaaaaa 
cttagtcacc 
ggggtggaca 
tcagagaaaa 
tctcagctat 
ttccaaaggg 
gatatttttc 
tattttaaaa 
tggtttttta 
ttaaccaatg 
actatgtcaa 
aatatgttta 
atttctatgg 
acagttagtg 
aagacctgtt 
aaaaaa 



aacaaatatg 
tgatttcctg 
ctgaaaaccc 
ttgtatgtgt 
tgaatgctta 
attattattt 
ttttaccctc 
tttaaaagaa 
ctttttatgt 
ctttgttttt 
tctgtttggc 
tggtttctta 
aagagataac 
cattccagcg 
gacagtattc 
tgggatgtat 



tctcagttgt 
tgcacctttt 
acaaaataaa 
ttaaattaaa 
gaagctgttc 
attttttatg 
tatttaaatg 
aaatataatt 
tctgtggttg 
tgttttgttt 
taatgtaatt 
atatttattg 
agtttgatat 
gatattttgg 
agcaacgcct 
tttttatttt 



attaagcacg 
ggcaacttct 
taaaacttgt 
ccctgtatca 
acatcttcaa 
cataaagtga 
ctttgaaaaa 
atgaaagcca 
atgttgtttg 
tgttttgttt 
aaagttgtta 
tgtagaagta 
gttttcatgt 
tgtttgcgag 
gatagcttct 
taaaaaaaaa 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1646 



<210> 291 

<211> 1851 

<212> DNA 

<213> Homo sapiens 



<400> 291 

tcatcaccat 

cttccgtgtt 

tcacttcctt 

ttgctgtttt 

caaattacat 

aagatacatc 

cagcaagtat 

tgattaaaaa 

ggccatgctt 

atttatcttc 

ttggatcagt 

cctttgtcag 

gcacgagttt 

tttgcttgtc 

ggactttacc 

acctgggatc 

cgctcccctg 

cttcacagag 

gtccatccag 

cagccatcaa 

acagaggatg 

cacaggtact 

aagagatgaa 

aatataattt 

ccagtcgcag 

tgtgtttctt 

gctcctgaga 

tcacataaac 

tttgacaaaa 

cttttcccca 

aaggtatgtc 



tgccagcagc 
cttcattctt 
taagcctttg 
cagaagagat 
gatgatgact 
aacattttgc 
gagagcagtt 
tttcaccact 
gttttttgat 
attgtagaca 
gccatgttcc 
agctgtcctc 
tactacttct 
cctcttgttc 
ccaccaggca 
catgaaggcg 
cagcagggga 
gagtcgttgt 
ggaggaagaa 
acttctggac 
agatccagaa 
gaaatcatgt 
gacactgcag 
tcctctggag 
agaagccaca 
ccccagtgat 
aacaccccag 
agaattaaaa 
tccagcatcc 
tttagtatta 
ccttctatgc 



ggcaccgtta 
cttcaatagc 
tgactcttcc 
ttttaacatc 
agaaacagca 
tcaagtagag 
cttccatatc 
tgctgttttt 
tcgatatcag 
gcatagtgta 
agcaacatta 
tttttgttgt 
gaattcccat 
acatccgtgt 
gctctgtgga 
ctgtcatcgt 
agcagtggca 
ggtctccaga 
atgcaggaaa 
agcaggtcac 
accacaatat 
catctgcggc 
tatatctgca 
ccatatggat 
ctgaagctct 
gcagcctcaa 
ctcttccggt 
gcaaagtcac 
ttgtatttat 
tgttggctgt 
ctgttttgct 



gtcaggtttt 
cataaatctt 
tctgatgtca 
tgtttttctt 
tactctctgg 
ggctgactat 
tatccagcgc 
gctcatgtat 
caccgtataa 
gagtggtatt 
acgcacattc 
caaggacatt 
tggcagaggc 
ccctgagcat 
gcttgtccag 
agtctcccca 
gcaccacttg 
agtgcccacg 
tgaaagatgc 
ttccagcaag 
ccattcacaa 
aacatggtgg 
caacgtaata 
gaactatgaa 
gtcctcagcc 
gttatcccga 
ctaacacagg 
ataagcatct 
tgttgcagtt 
gggcttgtca 
gagggtttta 



ctgggaatcc cacatgagta 
ctagctctgg ctggctgttt 
gctttaagtc ttgttctgga 
tgtagtcaga aagtaactgg 
ccgtctttcc agatcttgag 
acttgctgat ccacaacata 
atttaaattc gcttttttct 
accaagtagc agtggtgtga 
gagcagtgct ttggccatta 
tccatactca tctggaatat 
atcttcctgg cattgtacgg 
aagttgacat cgtctgtcca 
cagatgtaga gcagtcctct 
gacgatgaga tcctttctgg 
atcttctcca tggacgtggt 
agcgaccacg ttgctcttgc 
cacctcttgc tcccaagcgt 
ttgctcttgc cgctccccct 
atgcacgatg gtatactcct 
gtggagaaag ctgtccaccc 
acaaacactt ttcagccaga 
aacctaccca atcacacatc 
c t c tt c a t c c at aac a a aat 



ggaagaactc 
atcagcgcca 
agctgccgca 
caagtcaata 
caacagacac 
ctcagaggaa 
taggtggttt 
attctcgtgc 



cccgaagaag 
cggacaggar 
gcacacggtg 
aatgtgataa 
agaaaaggca 
atgcttctaa 
ttattacttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1851 



<210> 292 
<211> 1851 
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<212> DNA 

<213> Homo sapiens 

<400> 292 ^ 
tcatcaccat tgccagcagc ggcaccgtta gtcaggtttt ctgggaatcc cacatgagta 60 
cttccgtgtt cttcattctt cttcaatagc cataaatctt ctagctctgg ctggctgttt 120 
tcacttcctt taagcctttg tgactcttcc tctgatgtca gctttaagtc ttgttctgga 180 
ttgctgtttt cagaagagat ttttaacatc tgtttttctt tgtagtcaga aagtaactgg 240 
caaattacat gatgatgact agaaacagca tactctctgg ccgtctttcc agatcttgag 300 
aagatacatc aacattttgc tcaagtagag ggctgactat acttgctgat ccacaacata 360 
cagcaagtat gagagcagtt cttccatatc tatccagcgc atttaaattc gcttttttct 420 
tgattaaaaa tttcaccact tgctgttttt gctcatgtat accaagtagc agtggtgtga 480 
ggccatgctt gttttttgat tcgatatcag caccgtataa gagcagtgct ttggccatta 540 
atttatcttc attgtagaca gcatagtgta gagtggtatt tccatactca tctggaatat 600 
ttggatcagt gccatgttcc agcaacatta acgcacattc atcttcctgg cattgtacgg 660 
cctttgtcag agctgtcctc tttttgttgt caaggacatt aagttgacat cgtctgtcca 720 
gcacgagttt tactacttct gaattcccat tggcagaggc cagatgtaga gcagtcctct 780 
tttgcttgtc cctcttgttc acatccgtgt ccctgagcat gacgatgaga tcctttctgg 840 
ggactttacc ccaccaggca gctctgtgga gcttgtccag atcttctcca tggacgtggt 900 
acctgggatc catgaaggcg ctgtcatcgt agtctcccca agcgaccacg ttgctcttgc 960 
cgctcccctg cagcagggga agcagtggca gcaccacttg cacctcttgc tcccaagcgt 1020 
cttcacagag gagtcgttgt ggtctccaga agtgcccacg ttgctcttgc cgctccccct 1080 
gtccatccag ggaggaagaa atgcaggaaa tgaaagatgc atgcacgatg gtatactcct 1140 
cagccatcaa acttctggac agcaggtcac ttccagcaag gtggagaaag ctgtccaccc 1200 
acagaggatg agatccagaa accacaatat ccattcacaa acaaacactt ttcagccaga 1260 
cacaggtact gaaatcatgt catctgcggc aacatggtgg aacctaccca atcacacatc 1320 
aagagatgaa gacactgcag tatatctgca caacgtaata ctcttcatcc ataacaaaat 1380 
aatataattt tcctctggag ccatatggat gaactatgaa ggaagaactc cccgaagaag 1440 
ccagtcgcag agaagccaca ctgaagctct gtcctcagcc atcagcgcca cggacaggar 1500 
tgtgtttctt ccccagtgat gcagcctcaa gttatcccga agctgccgca gcacacggtg 1560 
gctcctgaga aacaccccag ctcttccggt ctaacacagg caagtcaata aatgtgataa 1620 
tcacataaac agaattaaaa gcaaagtcac ataagcatct caacagacac agaaaaggca 1680 
tttgacaaaa tccagcatcc ttgtatttat tgttgcagtt ctcagaggaa atgcttctaa 1740 
cttttcccca tttagtatta tgttggctgt gggcttgtca taggtggttt ttattacttt 1800 
aaggtatgtc ccttctatgc ctgttttgct gagggtttta attctcgtgc c 1851 

<210> 293 
<211> 668 
<212> DNA 
<213> Homo sapiens 

<400> 293 ^ CA 

cttgagcttc caaataygga agactggccc ttacacasgt caatgttaaa atgaatgcat 60 
ttcagtattt tgaagataaa attrgtagat ctataccttg ttttttgatt cgatatcagc 120 
accrtataag agcagtgctt tggccattaa tttatctttc attrtagaca gcrtagtgya 180 
gagtggtatt tccatactca tctggaatat ttggatcagt gccatgttcc agcaacatta 240 
acgcacattc atcttcctgg cattgtacgg cctgtcagta ttagacccaa aaacaaatta 300 
catatcttag gaattcaaaa taacattcca cagctttcac caactagtta tatttaaagg 360 
agaaaactca tttttatgcc atgtattgaa atcaaaccca cctcatgctg atatagttgg 420 
ctactgcata cctttatcag agctgtcctc tttttgttgt caaggacatt aagttgacat 480 
cgtctgtcca gcaggagttt tactacttct gaattcccat tggcagaggc cagatgtaga 540 
gcagtcctat gagagtgaga agacttttta ggaaattgta gtgcactagc tacagccata 600 
gcaatgattc atgtaactgc aaacactgaa tagcctgcta ttactctgcc ttcaaaaaaa 660 

668 

aaaaaaaa 



<210> 294 
<211> 1512 
<212> DNA 
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<213> Homo sapiens 
<400> 294 

gggtcgccca gggggsgcgt gggctttcct cgggtgggtg tgggttttcc ctgggtgggg 60 
tgggctgggc trgaatcccc tgctggggtt ggcaggtttt ggctgggatt gacttttytc 120 
ttcaaacaga ttggaaaccc ggagttacct gctagttggt gaaactggtt ggtagacgcg 180 
atctgttggc tactactggc ttctcctggc tgttaaaagc agatggtggt tgaggttgat 240 
tccatgccgg ctgcttcttc tgtgaagaag ccatttggtc tcaggagcaa gatgggcaag 300 
tggtgctgcc gttgcttccc ctgctgcagg gagagcggca agagcaacgt gggcacttct 360 
ggagaccacg acgactctgc tatgaagaca ctcaggagca agatgggcaa gtggtgccgc 420 
cactgcttcc cctgctgcag ggggagtggc aagagcaacg tgggcgcttc tggagaccac 480 
gacgaytctg ctatgaagac actcaggaac aagatgggca agtggtgctg ccactgcttc 540 
ccctgctgca gggggagcrg caagagcaag gtgggcgctt ggggagacta cgatgacagt 600 
gccttcatgg agcccaggta ccacgtccgt ggagaagatc tggacaagct ccacagagct 660 
gcctggtggg gtaaagtccc cagaaaggat ctcatcgtca tgctcaggga cactgacgtg 720 
aacaagaagg acaagcaaaa gaggactgct ctacatctgg cctctgccaa tgggaattca 780 
gaagtagtaa aactcstgct ggacagacga tgtcaactta atgtccttga caacaaaaag 840 
aggacagctc tgayaaaggc cgtacaatgc caggaagatg aatgtgcgtt aatgttgctg 900 
gaacatggca ctgatccaaa tattccagat gagtatggaa ataccactct rcactaygct 960 
rtctayaatg aagataaatt aatggccaaa gcactgctct tatayggtgc tgatatcgaa 1020 
tcaaaaaaca aggtatagat ctactaattt tatcttcaaa atactgaaat gcattcattt 1080 
taacattgac gtgtgtaagg gccagtcttc cgtatttgga agctcaagca taacttgaat 1140 
gaaaatattt tgaaatgacc taattatctm agactttatt ttaaatattg ttattttcaa 1200 
agaagcatta gagggtacag tttttttttt ttaaatgcac ttctggtaaa tacttttgtt 1260 
gaaaacactg aatttgtaaa aggtaatact tactattttt caatttttcc ctcctaggat 1320 
ttttttcccc taatgaatgt aagatggcaa aatttgccct gaaataggtt ttacatgaaa 1380 
actccaagaa aagttaaaca tgtttcagtg aatagagatc ctgctccttt ggcaagttcc 1440 
taaaaaacag taatagatac gaggtgatgc gcctgtcagt ggcaaggttt aagatatttc 1500 
tgatctcgtg cc 1512 

<210> 295 

<211> 1853 

<212> DNA 

<213> Homo sapiens 

<400> 295 

gggtcgccca gggggsgcgt gggctttcct cgggtgggtg tgggttttcc ctgggtgggg 60 
tgggctgggc trgaatcccc tgctggggtt ggcaggtttt ggctgggatt gacttttytc 120 
ttcaaacaga ttggaaaccc ggagttacct gctagttggt gaaactggtt ggtagacgcg 180 
atctgttggc tactactggc ttctcctggc tgttaaaagc agatggtggt tgaggttgat 240 
tccatgccgg ctgcttcttc tgtgaagaag ccatttggtc tcaggagcaa gatgggcaag 300 
tggtgctgcc gttgcttccc ctgctgcagg gagagcggca agagcaacgt gggcacttct 360 
ggagaccacg acgactctgc tatgaagaca ctcaggagca agatgggcaa gtggtgccgc 420 
cactgcttcc cctgctgcag ggggagtggc aagagcaacg tgggcgcttc tggagaccac 480 
gacgaytctg ctatgaagac actcaggaac aagatgggca agtggtgctg ccactgcttc 540 
ccctgctgca gggggagcrg caagagcaag gtgggcgctt ggggagacta cgatgacagy 600 
gccttcatgg akcccaggta ccacgtccrt ggagaagatc tggacaagct ccacagagct 660 
gcctggtggg gtaaagtccc cagaaaggat ctcatcgtca tgctcaggga cackgaygtg 720 
aacaagargg acaagcaaaa gaggactgct ctacatctgg cctctgccaa tgggaattca 780 
gaagtagtaa aactcstgct ggacagacga tgtcaactta atgtccttga caacaaaaag 840 
aggacagctc tgayaaaggc cgtacaatgc caggaagatg aatgtgcgtt aatgttgctg 900 
gaacatggca ctgatccaaa tattccagat gagtatggaa ataccactct rcactaygct 960 
rtctayaatg aagataaatt aatggccaaa gcactgctct tatayggtgc tgatatcgaa 1020 
tcaaaaaaca agcatggcct cacaccactg ytacttggtr tacatgagca aaaacagcaa 1080 
gtsgtgaaat ttttaatyaa gaaaaaagcg aatttaaaat gcrctggata gatatggaag 1140 
ractgctctc atacttgctg tatgttgtgg atcagcaagt atagtcagcc ytctacttga 1200 
gcaaaatrtt gatgtatctt ctcaagatct ggaaagacgg ccagagagta tgctgtttct 1260 
agtcatcatc atgtaatttg ccagttactt tctgactaca aagaaaaaca gatgttaaaa 1320 
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atctcttctg 
caaaggctta 
ttttggttta 
cctatgagac 
gcggtgtctc 
tcaggagatc 
aaacttagct 
ggagaatggc 
ccagcctggg 



aaaacagcaa 
aaggaagtga 
atgttttttt 
taggctttga 
acgcctgtaa 
gagaccatcc 
gggtgtggtg 
atgaacccgg 
tgacagagca 



tccagaacaa 
aaacagccag 
tttttgcctt 
gaatcaatag 
ttccagcacc 
tggctaacac 
gcgggtgcct 
gaggtggagg 
agactctgtc 



gacttaaagc 
ccagaggcat 
aataatatta 
attctttttt 
ttgagaggct 
ggtgaaaccc 
gtagtcccag 
ttgcagtgag 
tcaaaaaaaa 



tgacatcaga 
ggaaactttt 
gatagtccca 
taagaatctt 
gaggtgggca 
catctctact 
ctactcagga 
ccgagatccg 
aaaaaaaaaa 



ggaagagtca 
aaatttaaac 
aatgaaatwa 
ttggctagga 
gatcacgaga 
aaaaatacaa 
rgctgaggca 
ccactacact 
aaa 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1853 



<210> 296 

<211> 2184 

<212> DNA 

<213> Homo sapiens 



<400> 296 
ggcacgagaa 
aaaaccacct 
tttcctctga 
tgtgtctgtt 
ttattgactt 
ctgcggcagc 
gtggcgctga 
ggagttcttc 
tggatgaaga 
ttgggtaggt 
aaagtgtttg 
gctttctcca 
ccatcgtgca 
ggcaagagca 
agcaagaggt 
acgtggtcgc 
atggagaaga 
atctcatcgt 
ctctacatct 
gatgtcaact 
gccaggaaga 
atgagtatgg 
aagcactgct 
tgctacttgg 
cgaatttaaa 
gate age aag 
tggaaagacg 
ttctgactac 
agacttaaag 
gecagaggea 
taataatatt 
gattcttttt 
cttgagaggc 
eggtgaaace 
tgtagtccca 
gttgcagtga 
ctcaaaaaaa 



ttaaaaccct 
atgacaagee 
gaactgeaac 
gagatgetta 
gcctgtgtta 
ttcgggataa 
tggctgagga 
cttcatagtt 
gtattacgtt 
tccaccatgt 
tttgtgaatg 
ccttgctgga 
tgeatcttte 
acgtgggcac 
gcaagtggtg 
ttggggagac 
tctggacaag 
catgetcagg 
ggcctctgcc 
taatgtcctt 
tgaatgtgcg 
aaataccact 
ettataeggt 
tatacatgag 
tgcgctggat 
tatagtcagc 
gecagagagt 
aaagaaaaac 
ctgacatcag 
tggaaacttt 
agatagtccc 
ttaagaatct 
tgaggtgggc 
ccatctctac 
gctactcagg 
gccgagatcc 
aaaaaaaaaa 



cagcaaaaca 

cacagccaac 

aataaataca 

tgtgactttg 

gaeeggaaga 

cttgaggctg 

cagagcttca 

catccatatg 

gtgeagatat 

tgeegcagat 

gatattgtgg 

agtgacctgc 

atttcctgea 

ttctggagac 

ctgccactgc 

tacgatgaca 

ctccacagag 

gaeaeggatg 

aatgggaatt 

gacaacaaaa 

ttaatgttgc 

ctacactatg 

gctgatatcg 

caaaaacagc 

agatatggaa 

cctctacttg 

atgctgtttc 

agatgttaaa 

aggaagagtc 

taaatttaaa 

aaatgaaatw 

tttggctagg 

agate ac gag 

taaaaataca 

argctgaggc 

gccactacac 

aaaa 



ggcatagaag 

ataatactaa 

aggatgetgg 

cttttaattc 

gctggggtgt 

catcactggg 

gtgtggcttc 

gctccagagg 

actgcagtgt 

gacatgattt 

tttctggatc 

tgtccagaag 

tttcttcctc 

cacaacgact 

ttcccctgct 

gcgccttcat 

ctgcctggtg 

tgaacaagag 

cagaagtagt 

agaggacagc 

tggaacatgg 

ctgtctacaa 

aatcaaaaaa 

aagtggtgaa 

gaactgetet 

agcaaaatgt 

tagtcatcat 

aatctcttct 

acaaaggctt 

cttttggttt 

acctatgaga 

agcggtgtct 

atcaggagat 

aaaacttagc 

aggagaatgg 

tccagcctgg 



ggacatacct 
atggggaaaa 
attttgtcaa 
tgtttatgtg 
ttctcaggag 
gaagaaacac 
tetgegactg 
aaaattatat 
cttcatctct 
cagtacctgt 
tcatcctctg 
tttgatggct 
cctggatgga 
cctctgtgaa 
gcaggggagc 
ggatcccagg 
gggtaaagtc 
ggaeaagcaa 
aaaactcgtg 
tctgacaaag 
cactgatcca 
tgaagataaa 
caagcatggc 
atttttaatc 
catacttget 
tgatgtatct 
catgtaattt 
gaaaacagca 
aaaggaagtg 
aatgtttttt 
ctaggctttg 
cacgcctgta 
cgagaccatc 
tgggtgtggt 
catgaacccg 
gtgacagagc 



taaagtaata 60 
gttagaagca 120 
atgectttte 180 
attatcacat 240 
ccaccgtgtg 300 
aytcctgtcc 360 
gettcttegg 420 
tattttgtta 480 
tgatgtgtga 540 
gtctggctga 600 
tgggtggaca 660 
gaggagtata 720 
cagggggagc 780 
gacgcttggg 840 
ggcaagagca 900 
taccacgtcc 960 
cccagaaagg 1020 
aagaggactg 1080 
ctggacagac 1140 
geegtacaat 1200 
aatattccag 1260 
ttaatggcca 1320 
ctcacaccac 1380 
aagaaaaaag 1440 
gtatgttgtg 1500 
tctcaagatc 1560 
gecagttact 1620 
atccagaaca 1680 
aaaacageca 1740 
ttttttgect 1800 
agaatcaata 1860 
attccagcac 1920 
ctggctaaca 1980 
ggcgggtgcc 2040 
ggaggtggag 2100 
aagactctgt 2160 
2184 



<210> 297 

<211> 1855 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 
<222> 606 

<223> n = A,T,C or G 
<400> 297 

tgcacgcatc ggccagtgtc tgtgccacgt acactgacgc cccctgagat gtgcacgccg 60 
cacgcgcacg ttgcacgcgc ggcagcggct tggctggctt gtaacggctt gcacgcgcac 120 
gccgcccccg cataaccgtc agactggcct gtaacggctt gcaggcgcac gccgcacgcg 180 
cgtaacggct tggctgccct gtaacggctt gcacgtgcat gctgcacgcg cgttaacggc 240 
ttggctggca tgtagccgct tggcttggct ttgcattytt tgctkggctk ggcgttgkty 300 
tcttggattg acgcttcctc cttggatkga cgtttcctcc ttggatkgac gtttcytyty 360 
tcgcgttcct ttgctggact tgacctttty tctgctgggt ttggcattcc tttggggtgg 420 
gctgggtgtt ttctccgggg gggktkgccc ttcctggggt gggcgtgggk cgcccccagg 480 
gggcgtgggc tttccccggg tgggtgtggg ttttcctggg gtggggtggg ctgtgctggg 540 
atccccctgc tggggttggc agggattgac ttttttcttc aaacagattg gaaacccgga 600 
gtaacntgct agttggtgaa actggttggt agacgcgatc tgctggtact actgtttctc 660 
ctggctgtta aaagcagatg gtggctgagg ttgattcaat gccggctgct tcttctgtga 720 
agaagccatt tggtctcagg agcaagatgg gcaagtggtg cgccactgct tcccctgctg 780 
cagggggagc ggcaagagca acgtgggcac ttctggagac cacaacgact cctctgtgaa 840 
gacgcttggg agcaagaggt gcaagtggtg ctgcccactg cttcccctgc tgcaggggag 900 
cggcaagagc aacgtggkcg cttggggaga ctacgatgac agcgccttca tggakcccag 960 
gtaccacgtc crtggagaag atctggacaa gctccacaga gctgcctggt ggggtaaagt 1020 
ccccagaaag gatctcatcg tcatgctcag ggacactgay gtgaacaaga rggacaagca 1080 
aaagaggact gctctacatc tggcctctgc caatgggaat tcagaagtag taaaactcgt 1140 
gctggacaga cgatgtcaac ttaatgtcct tgacaacaaa aagaggacag ctctgacaaa 1200 
ggccgtacaa tgccaggaag atgaatgtgc gttaatgttg ctggaacatg gcactgatcc 1260 
aaatattcca gatgagtatg gaaataccac tctacactat gctgtctaca atgaagataa 1320 
attaatggcc aaagcactgc tcttatacgg tgctgatatc gaatcaaaaa acaaggtata 1380 
gatctactaa ttttatcttc aaaatactga aatgcattca ttttaacatt gacgtgtgta 1440 
agggccagtc ttccgtattt ggaagctcaa gcataacttg aatgaaaata ttttgaaatg 1500 
acctaattat ctaagacttt attttaaata ttgttatttt caaagaagca ttagagggta 1560 
cagttttttt tttttaaatg cacttctggt aaatactttt gttgaaaaca ctgaatttgt 1620 
aaaaggtaat acttactatt tttcaatttt tccctcctag gatttttttc ccctaatgaa 1680 
tgtaagatgg caaaatttgc cctgaaatag gttttacatg aaaactccaa gaaaagttaa 1740 
acatgtttca gtgaatagag atcctgctcc tttggcaagt tcctaaaaaa cagtaataga 1800 
tacgaggtga tgcgcctgtc agtggcaagg tttaagatat ttctgatctc gtgcc 1855 

<210> 298 

<211> 1059 

<212> DNA 

<213> Homo sapiens 

<400> 298 

gcaacgtggg cacttctgga gaccacaacg actcctctgt gaagacgctt gggagcaaga 60 
ggtgcaagtg gtgctgccca ctgcttcccc tgctgcaggg gagcggcaag agcaacgtgg 120 
gcgcttgrgg agactmcgat gacagygcct tcatggagcc caggtaccac gtccgtggag 180 
aagatctgga caagctccac agagctgccc tggtggggta aagtccccag aaaggatctc 240 
atcgtcatgc tcagggacac tgaygtgaac aagarggaca agcaaaagag gactgctcta 300 
catctggcct ctgccaatgg gaattcagaa gtagtaaaac tcstgctgga cagacgatgt 360 
caacttaatg tccttgacaa caaaaagagg acagctctga yaaaggccgt acaatgccag 420 
gaagatgaat gtgcgttaat gttgctggaa catggcactg atccaaatat tccagatgag 480 
tatggaaata ccactctrca ctaygctrtc tayaatgaag ataaattaat ggccaaagca 540 
ctgctcttat ayggtgctga tatcgaatca aaaaacaagg tatagatcta ctaattttat 600 
cttcaaaata ctgaaatgca ttcattttaa cattgacgtg tgtaagggcc agtcttccgt 660 
atttggaagc tcaagcataa cttgaatgaa aatattttga aatgacctaa ttatctaaga 720 
ctttatttta aatattgtta ttttcaaaga agcattagag ggtacagttt ttttttttta 780 
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aatgcacttc tggtaaatac ttttgttgaa aacactgaat ttgtaaaagg taatacttac 840 
tatttttcaa tttttccctc ctaggatttt tttcccctaa tgaatgtaag atggcaaaat 900 
ttgccctgaa ataggtttta catgaaaact ccaagaaaag ttaaacatgt ttcagtgaat 960 
agagatcctg ctcctttggc aagttcctaa aaaacagtaa tagatacgag gtgatgcgcc 1020 
tgtcagtggc aaggtttaag atatttctga tctcgtgcc 1059 

<210> 299 

<211> 329 

<212> PRT 

<213> Homo sapiens 

<400> 299 

Met Asp lie Val Val Ser Gly Ser His Pro Leu Trp Val Asp Ser Phe 

15 10 15 

Leu His Leu Ala Gly Ser Asp Leu Leu Ser Arg Ser Leu Met Ala Glu 

20 25 30 

Glu Tyr Thr He Val His Ala Ser Phe He Ser Cys He Ser Ser Ser 

35 40 45 

Leu Asp Gly Gin Gly Glu Arg Gin Glu Gin Arg Gly His Phe Trp Arg 

50 55 60 

Pro Gin Arg Leu Leu Cys Glu Asp Ala Trp Glu Gin Glu Val Gin Val 
65 70 75 80 

Val Leu Pro Leu Leu Pro Leu Leu Gin Gly Ser Gly Lys Ser Asn Val 

85 90 95 

Val Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr 

100 105 HO 

His Val His Gly Glu Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp 

115 120 125 

Gly Lys Val Pro Arg Lys Asp Leu He Val Met Leu Arg Asp Thr Asp 

130 135 140 

Val Asn Lys Arg Asp Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser 
145 150 155 160 

Ala Asn Gly Asn Ser Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys 

165 170 175 

Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr Ala Leu Thr Lys Ala 

180 185 190 

Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly 

195 200 205 

Thr Asp Pro Asn He Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr 

210 215 220 

Ala Val Tyr Asn Glu Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr 
225 230 235 240 

Gly Ala Asp He Glu Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu 

245 250 255 

Leu Gly He His Glu Gin Lys Gin Gin Val Val Lys Phe Leu He Lys 

260 265 270 

Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu 

275 280 285 

He Leu Ala Val Cys Cys Gly Ser Ala Ser He Val Ser Pro Leu Leu 

290 295 300 

Glu Gin Asn Val Asp Val Ser Ser Gin Asp Leu Glu Arg Arg Pro Glu 
305 310 315 320 

Ser Met Leu Phe Leu Val He He Met 
325 



<210> 300 
<211> 148 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> 3, 46, 69, 88, 124 

<223> Xaa = Any Amino Acid 

<400> 300 

Met Thr Xaa Pro Ser Trp Ser Pro Gly Thr Thr Ser Val Giu Lys He 

15 10 15 

Trp Thr Ser Ser Thr Glu Leu Pro Trp Trp Gly Lys Val Pro Arg Lys 

20 25 30 

Asp Leu He Val Met Leu Arg Asp Thr Asp Val Asn Lys Xaa Asp Lys 

35 40 45 

Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu 

50 55 60 

Val Val Lys Leu Xaa Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp 
65 70 75 80 

Asn Lys Lys Arg Thr Ala Leu Xaa Lys Ala Val Gin Cys Gin Glu Asp 

85 90 95 

Glu Cys Ala Leu Met Leu Leu Glu His Gly Thr Asp Pro Asn He Pro 

100 105 110 

Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala Xaa Tyr Asn Glu Asp 

115 120 125 

Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser 

130 135 140 

Lys Asn Lys Val 
145 



<210> 301 

<211> 1155 

<212> DNA 

<213> Homo sapiens 

<400> 301 

atggtggttg aggttgattc catgccggct gcctcttctg tgaagaagcc atttggtctc 60 
aggagcaaga tgggcaagtg gtgctgccgt tgcttcccct gctgcaggga gagcggcaag 120 
agcaacgtgg gcacttctgg agaccacgac gactctgcta tgaagacact caggagcaag 180 
atgggcaagt ggtgccgcca ctgcttcccc tgctgcaggg ggagtggcaa gagcaacgtg 240 
ggcgcttctg gagaccacga cgactctgct atgaagacac tcaggaacaa gatgggcaag 300 
tggtgctgcc actgcttccc ctgctgcagg gggagcggca agagcaaggt gggcgcttgg 360 
ggagactacg atgacagtgc cttcatggag cccaggtacc acgtccgtgg agaagatctg 420 
gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 
ctcagggaca ctgacgtgaa caagaaggac aagcaaaaga ggactgctct acatctggcc 540 
tctgccaatg ggaattcaga agtagtaaaa ctcctgctgg acagacgatg tcaacttaat 600 
gtccttgaca acaaaaagag gacagctctg ataaaggccg tacaatgcca ggaagatgaa 660 
tgtgcgttaa tgttgctgga acatggcact gatccaaata ttccagatga gtatggaaat 720 
accactctgc actacgctat ctataatgaa gataaattaa tggccaaagc actgctctta 780 
tatggtgctg atatcgaatc aaaaaacaag catggcctca caccactgtt acttggtgta 840 
catgagcaaa aacagcaagt cgtgaaattt ttaatcaaga aaaaagcgaa tttaaatgca 900 
ctggatagat atggaaggac tgctctcata cttgctgtat gttgtggatc agcaagtata 960 
gtcagccttc tacttgagca aaatattgat gtatcttctc aagatctatc tggacagacg 1020 
gccagagagt atgctgtttc tagtcatcat catgtaattt gccagttact ttctgactac 1080 
aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaaa tgtctcaaga 1140 
accagaaata aataa 1155 
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<210> 302 

<211> 2000 

<212> DNA 

<213> Homo sapiens 



<400> 302 
atggtggttg 
aggagcaaga 
agcaacgtgg 
atgggcaagt 
ggcgcttctg 
tggtgctgcc 
ggagactacg 
gacaagctcc 
ctcagggaca 
tctgccaatg 
gtccttgaca 
tgtgcgttaa 
accactctgc 
tatggtgctg 
catgagcaaa 
ctggatagat 
gtcagccttc 
gccagagagt 
aaagaaaaac 
ctgacatcag 
atgtctcaag 
aagcatgaaa 
aatggtgata 
cctgacaacg 
aaacagatgc 
tcagaggaag 
tttatggcta 
ctgactaatg 
agaacacctg 
caaaatgata 
attctgattc 
cttagttgta 
gccatgctaa 
aaaaaaaaaa 



aggttgattc 
tgggcaagtg 
gcacttctgg 
ggtgccgcca 
gagaccacga 
actgcttccc 
atgacagtgc 
acagagctgc 
ctgacgtgaa 
ggaattcaga 
acaaaaagag 
tgttgctgga 
actacgctat 
atatcgaatc 
aacagcaagt 
atggaaggac 
tacttgagca 
atgctgtttc 
agatgctaaa 
aggaagagtc 
aaccagaaat 
gtaataatgt 
atggattaat 
aaagtgaaga 
caaaatactc 



agtcacaaag 
tcgaagaaat 
gtgccactgc 
aaagccagca 
ctcagaagca 
atgaagaaaa 
agaaagaaaa 
gactggagct 
aaaaaaaaaa 



catgccggct 
gtgctgccgt 
agaccacgac 
ctgcttcccc 
cgactctgct 
ctgctgcagg 
cttcatggag 
ctggtggggt 
caagaaggac 
agtagtaaaa 
gacagctctg 
acatggcact 
ctataatgaa 
aaaaaacaag 
cgtgaaattt 
tgctctcata 
aaatattgat 
tagtcatcat 
aatctcttct 
acaaaggttc 
aaataaggat 
gggattacta 
tcctcaaagg 
gtatcacaga 
ttctgaaaac 
gcttgagggc 
gaagaagcac 
tggcaatggt 
atttcctgac 
attttgtgaa 
gcagatagaa 
agacatcttg 
agacacaatg 



gcctcttctg 
tgcttcccct 
gactctgcta 
tgctgcaggg 
atgaagacac 
gggagcggca 
cccaggtacc 
aaagtcccca 
aagcaaaaga 
ctcctgctgg 
ataaaggccg 
gatccaaata 
gataaattaa 
catggcctca 
ttaatcaaga 
cttgctgtat 
gtatcttctc 
catgtaattt 
gaaaacagca 
aaaggcagtg 
ggtgatagag 
gaaaacctga 
aagagcagaa 
atttgcgaat 
agcaacccag 
agtgaaaatg 
ggaagtactc 
gatgatggat 
actgagaatg 
gaacagaaca 
gtggttgaaa 
catgaaaata 
aaacatcaga 



tgaagaagcc 
gctgcaggga 
tgaagacact 
ggagtggcaa 
tcaggaacaa 
agagcaaggt 
acgtccgtgg 
gaaaggatct 
ggactgctct 
acagacgatg 
tacaatgcca 
ttccagatga 
tggccaaagc 
caccactgtt 
aaaaagcgaa 
gttgtggatc 
aagatctatc 
gccagttact 
atccagaaca 
aaaatagcca 
aggttgaaga 
ctaatggtgt 
cacctgaaaa 
tagtttctga 
aacaagactt 
gccagccaga 
atgtcggatt 
taattcctcc 
aagagtatca 
ctggaatatt 
aaatgaattc 
gtacgttgcg 
gccagctaaa 



atttggtctc 
gagcggcaag 
caggagcaag 
gagcaacgtg 
gatgggcaag 
gggcgcttgg 
agaagatctg 
catcgtcatg 
acatctggcc 
tcaacttaat 
ggaagatgaa 
gtatggaaat 
actgctctta 
acttggtgta 
tttaaatgca 
agcaagtata 
tggacagacg 
ttctgactac 
agacttaaag 
gccagagaaa 
agaaatgaag 
cactgctggc 
tcagcaattt 
ctacaaagaa 
aaagctgaca 
gctagaaaat 
cccagaaaac 
aaggaagagc 
cagtgacgaa 
acacgatgag 
tgagctttct 
ggaagaaatt 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2000 



<210> 303 
<211> 2040 
<212> DNA 

<213> Homo sapiens 



<400> 303 

atggtggttg 

aggagcaaga 

agcaacgtgg 

atgggcaagt 

ggcgcttctg 

tggtgctgcc 

ggagactacg 

gacaagctcc 

ctcagggaca 

tctgccaatg 

gtccttgaca 



aggttgattc 
tgggcaagtg 
gcacttctgg 
ggtgccgcca 
gagaccacga 
actgcttccc 
atgacagtgc 
acagagctgc 
ctgacgtgaa 
ggaattcaga 
acaaaaagag 



catgccggct 
gtgctgccgt 
agaccacgac 
ctgcttcccc 
cgactctgct 
ctgctgcagg 
cttcatggag 
ctggtggggt 
caagaaggac 
agtagtaaaa 
gacagctctg 



gcctcttctg 
tgcttcccct 
gactctgcta 
tgctgcaggg 
atgaagacac 
gggagcggca 
cccaggtacc 
aaagtcccca 
aagcaaaaga 
ctcctgctgg 
ataaaggccg 



tgaagaagcc 
gctgcaggga 
tgaagacact 
ggagtggcaa 
tcaggaacaa 
agagcaaggt 
acgtccgtgg 
gaaaggatct 
ggactgctct 
acagacgatg 
tacaatgcca 



atttggtctc 60 
gagcggcaag 120 
caggagcaag 180 
gagcaacgtg 240 
gatgggcaag 300 
gggcgcttgg 360 
agaagatctg 420 
catcgtcatg 4 80 
acatctggcc 540 
tcaacttaat 600 
ggaagatgaa 660 
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tgtgcgttaa tgttgctgga acatggcact gatccaaata ttccagatga gtatggaaat 720 
accactctgc actacgctat ctataatgaa gataaattaa tggccaaagc actgctctta 780 
tatggtgctg atatcgaatc aaaaaacaag catggcctca caccactgtt acttggtgta 840 
catgagcaaa aacagcaagt cgtgaaattt ttaatcaaga aaaaagcgaa tttaaatgca 900 
ctggatagat atggaaggac tgctctcata cttgctgtat gttgtggatc agcaagtata 960 
gtcagccttc tacttgagca aaatattgat gtatcttctc aagatctatc tggacagacg 1020 
gccagagagt atgctgtttc tagtcatcat catgtaattt gccagttact ttctgactac 1080 
aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaca agacttaaag 1140 
ctgacatcag aggaagagtc acaaaggttc aaaggcagtg aaaatagcca gccagagaaa 1200 
atgtctcaag aaccagaaat aaataaggat ggtgatagag aggttgaaga agaaatgaag 1260 
aagcatgaaa gtaataatgt gggattacta gaaaacctga ctaatggtgt cactgctggc 1320 
aatggtgata atggattaat tcctcaaagg aagagcagaa cacctgaaaa tcagcaattt 1380 
cctgacaacg aaagtgaaga gtatcacaga atttgcgaat tagtttctga ctacaaagaa 1440 
aaacagatgc caaaatactc ttctgaaaac agcaacccag aacaagactt aaagctgaca 1500 
tcagaggaag agtcacaaag gcttgagggc agtgaaaatg gccagccaga gaaaagatct 1560 
caagaaccag aaataaataa ggatggtgat agagagctag aaaattttat ggctatcgaa 1620 
gaaatgaaga agcacggaag tactcatgtc ggattcccag aaaacctgac taatggtgcc 1680 
actgctggca atggtgatga tggattaatt cctccaagga agagcagaac acctgaaagc 1740 
cagcaatttc ctgacactga gaatgaagag tatcacagtg acgaacaaaa tgatactcag 1800 
aagcaatttt gtgaagaaca gaacactgga atattacacg atgagattct gattcatgaa 1860 
gaaaagcaga tagaagtggt tgaaaaaatg aattctgagc tttctcttag ttgtaagaaa 1920 
gaaaaagaca tcttgcatga aaatagtacg ttgcgggaag aaattgccat gctaagactg 1980 
gagctagaca caatgaaaca tcagagccag ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2040 



<210> 304 
<211> 384 
<212> PRT 
<213> Homo sapiens 



<400> 304 



Met 


Val 


Val 


Glu 


Val 


Asp 


Ser 


Met 


Pro 


Ala 


Ala Ser 


Ser 


Val 


Lys 


Lys 


1 








5 










10 








15 




Pro 


Phe 


Gly 


Leu 


Arg 


Ser 


Lys 


Met 


Gly 


Lys 


Trp Cys 


Cys 


Arg 


Cys 


Phe 






20 










25 








30 






Pro 


Cys 


Cys 


Arg 


Glu 


Ser 


Gly 


Lys 


Ser 


Asn 


Val Gly 


Thr 


Ser 


Gly 


Asp 




35 










40 








45 








His 


Asp 
50 


Asp 


Ser 


Ala 


Met 


Lys 
55 


Thr 


Leu 


Arg 


Ser Lys 
60 


Met 


Gly 


Lys 


Trp 


Cys 


Arg 


His 


Cys 


Phe 


Pro 


Cys 


Cys 


Arg 


Gly 


Ser Gly 


Lys 


Ser 


Asn 


Val 


65 










70 










75 








80 


Gly 


Ala 


Ser 


Gly 


Asp 


His 


Asp 


Asp 


Ser 


Ala 


Met Lys 


Thr 


Leu 


Arg 


Asn 








85 










90 








95 




Lys 


Met 


Gly 


Lys 


Trp 


Cys 


Cys 


His 


Cys 


Phe 


Pro Cys 


Cys 


Arg 


Gly 


Ser 




100 










105 








110 






Gly 


Lys 


Ser 


Lys 


Val 


Gly 


Ala 


Trp 


Gly 


Asp 


Tyr Asp 


Asp 


Ser 


Ala 


Phe 


115 










120 








125 






His 


Met 


Glu 


Pro 


Arg 


Tyr 


His 


Val 


Arg 


Gly 


Glu 


Asp Leu 


Asp 


Lys 


Leu 




130 






135 








140 










Arg 


Ala 


Ala 


Trp 


Trp 


Gly 


Lys 


Val 


Pro 


Arg 


Lys Asp 


Leu 


lie 


Val 


Met 


145 










150 










155 








160 


Leu 


Arg 


Asp 


Thr 


Asp 


Val 


Asn 


Lys 


Lys 


Asp 


Lys Gin 


Lys 


Arg 


Thr 


Ala 








165 










170 








175 




Leu 


His 


Leu 


Ala 


Ser 


Ala 


Asn 


Gly 


Asn 


Ser 


Glu Val 


Val 


Lys 


Leu 


Leu 








180 








185 








190 




Thr 


Leu 


Asp 


Arg 


Arg 


Cys 


Gin 


Leu 


Asn 


Val 


Leu 


Asp Asn 


Lys 


Lys 


Arg 




195 








200 








205 








Ala 


Leu 


lie 


Lys 


Ala 


Val 


Gin 


Cys 


Gin 


Glu 


Asp Glu 


Cys 


Ala 


Leu 


Met 
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210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn He Pro Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Thr Leu His Tyr Ala He Tyr Asn Glu Asp Lys Leu Met Ala Lys 

245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser Lys Asn Lys His Gly 

260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 

275 280 285 

Lys Phe Leu He Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr 

290 295 300 

Gly Arg Thr Ala Leu He Leu Ala Val Cys Cys Gly Ser Ala Ser He 
305 310 315 320 

Val Ser Leu Leu Leu Glu Gin Asn He Asp Val Ser Ser Gin Asp Leu 

325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 

340 345 350 

He Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys He 

355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg Thr Arg Asn Lys 
370 375 380 



<210> 305 
<211> 656 
<212> PRT 
<213> Homo sapiens 

<400> 305 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 

15 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe 

20 25 30 

Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 

35 40 45 

His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lys Trp 

50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val 
65 70 75 80 

Gly Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Asn 

85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 

100 105 HO 

Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe 

115 120 125 

Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu His 

130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu He Val Met 
145 150 155 160 

Leu Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala 

165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu* 

180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 

195 200 205 

Ala Leu He Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met 

210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn He Pro Asp Glu Tyr Gly Asn 
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225 



230 



235 



240 



Thr Thr Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met Ala Lys 

245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser Lys Asn Lys His Gly 

260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 

275 280 285 

Lys Phe Leu lie Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr 

290 295 300 

Gly Arg Thr Ala Leu lie Leu Ala Val Cys Cys Gly Ser Ala Ser lie 
305 310 315 320 

Val Ser Leu Leu Leu Glu Gin Asn lie Asp Val Ser Ser Gin Asp Leu 

325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 

340 345 350 

lie Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys He 

355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Gin Asp Leu Lys Leu Thr Ser Glu 

370 375 380 

Glu Glu Ser Gin Arg Phe Lys Gly Ser Glu Asn Ser Gin Pro Glu Lys 
385 390 395 400 

Met Ser Gin Glu Pro Glu He Asn Lys Asp Gly Asp Arg Glu Val Glu 

405 410 415 

Glu Glu Met Lys Lys His Glu Ser Asn Asn Val Gly Leu Leu Glu Asn 

420 425 430 

Leu Thr Asn Gly Val Thr Ala Gly Asn Gly Asp Asn Gly Leu He Pro 

435 440 445 

Gin Arg Lys Ser Arg Thr Pro Glu Asn Gin Gin Phe Pro Asp Asn Glu 

450 455 460 

Ser Glu Glu Tyr His Arg He Cys Glu Leu Val Ser Asp Tyr Lys Glu 
465 470 475 480 

Lys Gin Met Pro Lys Tyr Ser Ser Glu Asn Ser Asn Pro Glu Gin Asp 

485 490 495 

Leu Lys Leu Thr Ser Glu Glu Glu Ser Gin Arg Leu Glu Gly Ser Glu 

500 505 510 

Asn Gly Gin Pro Glu Leu Glu Asn Phe Met Ala He Glu Glu Met Lys 

515 520 525 

Lys His Gly Ser Thr His Val Gly Phe Pro Glu Asn Leu Thr Asn Gly 

530 535 540 

Ala Thr Ala Gly Asn Gly Asp Asp Gly Leu He Pro Pro Arg Lys Ser 
545 550 555 560 

Arg Thr Pro Glu Ser Gin Gin Phe Pro Asp Thr Glu Asn Glu Glu Tyr 

565 570 575 

His Ser Asp Glu Gin Asn Asp Thr Gin Lys Gin Phe Cys Glu Glu Gin 

580 585 590 

Asn Thr Gly He Leu His Asp Glu He Leu He His Glu Glu Lys Gin 

595 600 605 

He Glu Val Val Glu Lys Met Asn Ser Glu Leu Ser Leu Ser Cys Lys 

610 615 620 

Lys Glu Lys Asp He Leu His Glu Asn Ser Thr Leu Arg Glu Glu He 
625 630 635 640 

Ala Met Leu Arg Leu Glu Leu Asp Thr Met Lys His Gin Ser Gin Leu 



645 



650 



655 



<210> 306 
<211> 671 
<212> PRT 
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<213> Homo sapiens 
<400> 306 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 

15 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe 

20 25 30 

Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 

35 40 45 

His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lys Trp 

50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val 
65 70 75 80 

Gly Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Asn 

85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 

100 105 110 

Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe 

115 120 125 

Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu His 

130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu He Val Met 
145 150 155 160 

Leu Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala 

165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu 

180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 

195 200 205 

Ala Leu He Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met 

210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn He Pro Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Thr Leu His Tyr Ala He Tyr Asn Glu Asp Lys Leu Met Ala Lys 

245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser Lys Asn Lys His Gly 

260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 

275 280 285 

Lys Phe Leu He Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr 

290 295 300 

Gly Arg Thr Ala Leu He Leu Ala Val Cys Cys Gly Ser Ala Ser He 
305 310 315 320 

Val Ser Leu Leu Leu Glu Gin Asn He Asp Val Ser Ser Gin Asp Leu 

325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 

340 345 350 

He Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys He 

355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Gin Asp Leu Lys Leu Thr Ser Glu 

370 375 380 

Glu Glu Ser Gin Arg Phe Lys Gly Ser Glu Asn Ser Gin Pro Glu Lys 
385 390 395 400 

Met Ser Gin Glu Pro Glu He Asn Lys Asp Gly Asp Arg Glu Val Glu 

405 410 415 

Glu Glu Met Lys Lys His Glu Ser Asn Asn Val Gly Leu Leu Glu Asn 

420 425 430 

Leu Thr Asn Gly Val Thr Ala Gly Asn Gly Asp Asn Gly Leu He Pro 
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4.55 


AA(\ 








fif(3 








Gin 


Arg 


Lys Ser Arg Thr Pro 


G1U 


Asn 


Gin 




File rlu 


Asp 


Asn 


IjlU 




450 


A C C 

455 










40U 








Ser 


Glu 


Glu Tyr His Arg He 


Cys 


Glu 


Leu 


vai 


Ser Asp 


Tyr 


Lys 


blU 


465 




Am 
4 /□ 








JllC 

*i f D 








4QU 


Lys 


Gin 


Met Pro Lys Tyr Ser 


Ser 


Glu 


Asn 


ser 


Asn Pro 


Vj-LU 




Asp 






485 






490 








A Q C 

495 




Leu 


Lys 


Leu Thr Ser Glu Glu 


Glu 


Ser 


Gin 


Arg 


Leu Glu 


Gly 


Ser 


GlU 






500 




505 








CIA 

510 






Asn 


Gly 


Gin Pro Glu Lys Arg 


Ser 


Gin 


Glu 


Pro 


Glu He 


Asn 


Lys 


Asp 






515 


520 








5^5 








Gly 


Asp 


Arg Glu Leu Glu Asn 


Phe 


Met 


Ala 


He 


GlU GlU 


Met 


Lys 


Lys 




530 


535 










540 








His 


Gly 


Ser Thr His Val Gly 


Phe 


Pro 


Glu 


Asn 


Leu Thr 


Asn 


Gly 


Ala 


545 




550 








555 








5oU 


Thr 


Ala 


Gly Asn Gly Asp Asp 


Gly Leu 


He 


Pro 


Pro Arg 


Lys 


Ser 


Arg 






565 






570 








C *7 C 

5/5 




Thr 


Pro 


Glu Ser Gin Gin Phe 


Pro Asp 


Thr 


Glu 


Asn Glu 


Glu 


Tyr 


His 






580 




585 








590 






Ser 


Asp 


Glu Gin Asn Asp Thr 


Gin Lys 


Gin 


Phe 


Cys Glu 


Glu 


Gin 


Asn 






595 


600 








605 








Thr 


Gly 


He Leu His Asp Glu 


He 


Leu 


He 


His 


Glu Glu 


Lys 


Gin 


He 




610 


615 










620 








Glu 


Val 


Val Glu Lys Met Asn 


Ser 


Glu 


Leu 


Ser 


Leu Ser 


Cys 


Lys 


Lys 


625 




630 








635 








640 


Glu 


Lys 


Asp He Leu His Glu 


Asn 


Ser 


Thr 


Leu Arg Glu 


Glu 


He 


Ala 






645 






650 








655 




Met 


Leu 


Arg Leu Glu Leu Asp 


Thr 


Met 


Lys 


His 


Gin Ser 


Gin 


Leu 








660 




665 








670 







<210> 307 
<211> 800 
<212> DNA 
<213> Homo sapiens 



<400> 307 

atkagcttcc 

acttcatttt 

agaatgctta 

tttttcttgg 

catcagtaag 

ctagtgtttc 

cactgcaggt 

ctttcagatg 

atgggacaaa 

ggcttggccc 

ttacaatact 

accttatgtc 

acatcccaca 

tcctattagt 



gcttctgaca 
tggtacataa 
ggactctaac 
tcctccttgt 
ggccactaaa 
tgttgctgtg 
tcttgggcag 
ggaaacactc 
tttgacccac 
caacattctc 
atcctgcagc 
caagctttct 
ggaggacctc 
gataagcctc 



acactagaga 
catctttata 
aggtttttga 
ggtctaggag 
tccgaccttc 
tcagtgagca 
ggggagaaac 
aggcatcaac 
aaaccctgga 
tctctgatgg 
ttgacctttt 
tttcattgaa 
tcagcttacc 



tccctcccct 
ggacaggggt 
gaatgtgttg 
gacaggcaag 
ctcgttcctc 
caactattcc 
aaaacaaacc 
aggctcacct 
aaaagaggtg 
ggaaaaatgg 
ctgtaagagg 
ggagaataca 
cccatatcct 



ccctcagggt 
aaaatcccaa 
gtaagggcca 
ggtgcagatt 
cttgtggtct 
gat cage agg 
aaaaccatgg 
ttgaaatgea 
gctcattttt 
ccacctgagg 
gaaggcaaat 
etatgeaaag 
agcctcccta 



atggccctcc 60 
tactaacagg 120 
ctcaatccaa 180 
ttcaagaatg 240 
gggaggaaaa 300 
gtccagggac 360 
gcrgttttgt 420 
tcctaagcca 480 
tttgeactat 540 
gaagtacaga 600 
ggagtgaaat 660 
cttgaaattt 720 
tagctcccct 780 
800 



<210> 308 
<211> 102 
<212> PRT 
<213> Homo sapiens 



<220> 
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<221> VARIANT 
<222> 3 

<223> Xaa = Any Amino Acid 
<400> 308 

Met Gly Xaa Phe Val Phe Gin Met Gly Asn Thr Gin Ala Ser Thr Gly 

15 10 15 

Ser Pro Leu Lys Cys He Leu Ser Gin Trp Asp Lys Phe Asp Pro Gin 

20 25 30 

Thr Leu Glu Lys Glu Val Ala His Phe Phe Cys Thr Met Ala Trp Pro 

35 40 45 

Gin His Ser Leu Ser Asp Gly Glu Lys Trp Pro Pro Glu Gly Ser Thr 

50 55 60 

Asp Tyr Asn Thr He Leu Gin Leu Asp Leu Phe Cys Lys Arg Glu Gly 
65 70 75 80 

Lys Trp Ser Glu He Pro Tyr Val Gin Ala Phe Phe Ser Leu Lys Glu 

85 90 95 

Asn Thr Leu Cys Lys Ala 
100 



<210> 309 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in the lab 
<400> 309 

Leu Met Ala Glu Glu Tyr Thr He Val 
1 5 



<210> 310 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in the lab 
<400> 310 

Lys Leu Met Ala Lys Ala Leu Leu Leu 
1 5 



<210> 311 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in the lab 
<400> 311 

Gly Leu Thr Pro Leu Leu Leu Gly He 
1 5 
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<210> 312 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in the lab 
<400> 312 

Lys Leu Val Leu Asp Arg Arg Cys Gin Leu 
15 10 



<210> 313 

<211> 1852 

<212> DNA 

<213> Homo sapiens 

<400> 313 

ggcacgagaa ttaaaaccct cagcaaaaca ggcatagaag ggacatacct 
aaaaccacct atgacaagcc cacagccaac ataatactaa atggggaaaa 
tttcctctga gaactgcaac aataaataca aggatgctgg attttgtcaa 
tgtgtctgtt gagatgctta tgtgactttg cttttaattc tgtttatgtg 
ttattgactt gcctgtgtta gaccggaaga gctggggtgt ttctcaggag 
ctgcggcagc ttcgggataa cttgaggctg catcactggg gaagaaacac 
gtggcgctga tggctgagga cagagcttca gtgtggcttc tctgcgactg 
ggagttcrttc cttcatagtt catccatatg gctccagagg aaaattatat 
tggatgaaga gtattacgtt gtgcagatat actgcagtgt cttcatctct 
ttgggtaggt tccaccatgt tgccgcagat gacatgattt cagtacctgt 
aaagtgtttg tttgtgaatg gatattgtgg tttctggatc tcatcctctg 
gctttctcca ccttgctgga agtgacctgc tgtccagaag tttgatggct 
ccatcgtgca tgcatctttc atttcctgca tttcttcctc cctggatgga 
ggcaagagca acgtgggcac ttctggagac cacaacgact cctctgtgaa 
agcaagaggt gcaagtggtg ctgccactgc ttcccctgct gcagggggag 
aacgtggtcg cttggggaga ctacgatgac agcgccttca tggatcccag 
catggagaag atctggacaa gctccacaga gctgcctggt ggggtaaagt 
gatctcatcg tcatgctcag ggacacggat gtgaacaaga gggacaagca 
gctctacatc tggcctctgc caatgggaat tcagaagtag taaaactcgt 
cgatgtcaac ttaatgtcct tgacaacaaa aagaggacag ctctgacaaa 
tgccaggaag atgaatgtgc gttaatgttg ctggaacatg gcactgatcc 
gatgagtatg gaaataccac tctacactat gctgtctaca atgaagataa 
aaagcactgc tcttatacgg tgctgatatc gaatcaaaaa acaagcatgg 
ctgctacttg gtatacatga gcaaaaacag caagtggtga aatttttaat 
gcgaatttaa atgcgctgga tagatatgga agaactgctc tcatacttgc 
ggatcagcaa gtatagtcag ccctctactt gagcaaaatg ttgatgtatc 
ctggaaagac ggccagagag tatgctgttt ctagtcatca tcatgtaatt 
tttctgacta caaagaaaaa cagatgttaa aaatctcttc tgaaaacagc 
aagacttaaa gctgacatca gaggaagagt cacaaaggct taaaggaagt 
agccagagct agaagattta tggctattga agaagaatga agaacacgga 
tgggattccc agaaaacctg actaacggtg ccgctgctgg caatggtgat 



<210> 314 
<211> 879 
<212> DNA 
<213> Homo sapiens 



taaagtaata 60 
gttagaagca 120 
atgccttttc 180 
attatcacat 240 
ccaccgtgtg 300 
aytcctgtcc 360 
gcttcttcgg 420 
tattttgtta 480 
tgatgtgtga 540 
gtctggctga 600 
tgggtggaca 660 
gaggagtata 720 
cagggggagc 780 
gacgcttggg 840 
cggcaagagc 900 
gtaccacgtc 960 
ccccagaaag 1020 
aaagaggact 1080 
gctggacaga 1140 
ggccgtacaa 1200 
aaatattcca 1260 
attaatggcc 1320 
cctcacacca 1380 
caagaaaaaa 1440 
tgtatgttgt 1500 
ttctcaagat 1560 
tgccagttac 1620 
aatccagaac 1680 
gaaaacagcc 1740 
agtactcatg 1800 
ga 1852 
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<400> 314 

atgcatcttt catttcctgc atttcttcct ccctggatgg acagggggag cggcaagagc 60 
aacgtgggca cttctggaga ccacaacgac tcctctgtga agacgcttgg gagcaagagg 120 
tgcaagtggt gctgccactg cttcccctgc tgcaggggga gcggcaagag caacgtggtc 180 
gcttggggag actacgatga cagcgccttc atggatccca ggtaccacgt ccatggagaa 240 
gatctggaca agctccacag agctgcctgg tggggtaaag tccccagaaa ggatctcatc 300 
gtcatgctca gggacacgga tgtgaacaag agggacaagc aaaagaggac tgctctacat 360 
ctggcctctg ccaatgggaa ttcagaagta gtaaaactcg tgctggacag acgatgtcaa 420 
cttaatgtcc ttgacaacaa aaagaggaca gctctgacaa aggccgtaca atgccaggaa 480 
gatgaatgtg cgttaatgtt gctggaacat ggcactgatc caaatattcc agatgagtat 540 
ggaaatacca ctctacacta tgctgtctac aatgaagata aattaatggc caaagcactg 600 
ctcttatacg gtgctgatat cgaatcaaaa aacaagcatg gcctcacacc actgctactt 660 
ggtatacatg agcaaaaaca gcaagtggtg aaatttttaa tcaagaaaaa agcgaattta 720 
aatgcgctgg atagatatgg aagaactgct ctcatacttg ctgtatgttg tggatcagca 780 
agtatagtca gccctctact tgagcaaaat gttgatgtat cttctcaaga tctggaaaga 840 
cggccagaga gtatgctgtt tctagtcatc atcatgtaa 879 

<210> 315 
<211> 292 
<212> PRT 
<213> Homo sapiens 

<400> 315 

Met His Leu Ser Phe Pro Ala Phe Leu Pro Pro Trp Met Asp Arg Gly 
5 10 15 

Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asn Asp Ser Ser 
20 25 30 

Val Lys Thr Leu Gly Ser Lys Arg Cys Lys Trp Cys Cys His Cys Phe 
35 40 45 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Val Ala Trp Gly Asp 
50 55 60 

Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr His Val His Gly Glu 
65 70 75 80 

Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val Pro Arg 
85 90 95 

Lys Asp Leu lie Val Met Leu Arg Asp Thr Asp Val Asn Lys Arg Asp 
100 105 110 

Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser 
115 120 125 

Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys Gin Leu Asn Val Leu 
130 135 140 

Asp Asn Lys Lys Arg Thr Ala Leu Thr Lys Ala Val Gin Cys Gin Glu 
145 150 155 160 

Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly Thr Asp Pro Asn lie 
165 170 175 



Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala Val Tyr Asn Glu 
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180 185 190 

Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu 
195 200 205 

Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu Leu Gly lie His Glu 
210 215 220 

Gin Lys Gin Gin Val Val Lys Phe Leu lie Lys Lys Lys Ala Asn Leu 
225 230 235 240 

Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu lie Leu Ala Val Cys 
245 250 255 

Cys Gly Ser Ala Ser lie Val Ser Pro Leu Leu Glu Gin Asn Val Asp 
260 265 270 

Val Ser Ser Gin Asp Leu Glu Arg Arg Pro Glu Ser Met Leu Phe Leu 
275 280 285 

Val He He Met 
290 

<210> 316 
<211> 584 
<212> DNA 
<213> Homo sapiens 

<400> 316 

agttgggcca aattcccctc cccctacagc ttgaagggga cataaccaat agcctggggt 60 
ttttttgtgg tcctttggag atttctttgc ttattttctt ctgggtgggg gtgattagag 120 
gaggcttatc actaatagga aggggagcta tagggaggct aggatatggg ggtaagctga 180 
gaggtcctcc tgtgggatgt aaatttcaag ctttgcatag tgtattctcc ttcaatgaaa 240 
agaaagcttg gacataaggt atttcactcc atttgccttc cctcttacag aaaaggtcaa 300 
gctgcaggat agtattgtaa tctgtacttc cctcaggtgg ccatttttcc ccatcagaga 360 
gagaatgttg gggccaagcc atagtgcaga aaaaaaaatg agccacctct ttttccaggg 420 
tttgtgggtc aaatttgtcc cattggctta ggatgcattt caaaggtgag cctgttgatg 480 
cctgagtgtt tcccatctga aagacaaaac tgcccatggt tttggtttgt tttgtttctc 540 
cccctgccca agaactatca aactcctgag ccaacaacta aaaa 584 

<210> 317 
<211> 829 
<212> DNA 
<213> Homo sapiens 

<400> 317 

attagcttcc gcttctgaca acactagaga tccctcccct 
acttcatttt tggtacataa catctttata ggacaggggt 
agaatgctta ggactctaac aggtttttga gaatgtgttg 
tttttcttgg tcctccttgt ggtctaggag gacaggcaag 
catcagtaag ggccactaaa tccgaccttc ctcgttcctc 
ctagtgtttc tgttgctgtg tcagtgagca caactattcc 
cactgcaggt tcttgggcag ggggagaaac aaaacaaacc 
ctttcagatg ggaaacactc aggcatcaac aggctcacct 
atgggacaaa tttgacccac aaaccctgga aaaagaggtg 
ggcttggccc caacattctc tctctgatgg ggaaaaatgg 
ttacaatact atcctgcagc ttgacctttt ctgtaagagg 
accttatgtc caagctttct tttcattgaa ggagaataca 



ccctcagggt atggccctcc 60 
aaaatcccaa tactaacagg 120 
gtaagggcca ctcaatccaa 180 
ggtgcagatt ttcaagaatg 240 
cttgtggtct gggaggaaaa 300 
gate age agg gtccagggac 360 
aaaaccatgg gcagttttgt 420 
ttgaaatgea tcctaagcca 480 
gctcattttt tttgeactat 540 
ccacctgagg gaagtacaga 600 
gaaggcaaat ggagtgaaat 660 
etatgeaaag cttgaaattt 720 
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acatcccaca ggaggacctc tcagcttacc cccatatcct agcctcccta tagctcccct 780 
tcctattagt gataagcctc ctctaatcac ccccacccag aagaaaata 829 

<210> 318 

<211> 30 

<212> PRT 

<213> Homo sapien 

<400> 318 

Thr Ala Ala Ser Asp Asn Phe Gin Leu Ser Gin Gly Gly Gin Gly Phe 
15 10 15 

Ala He Pro He Gly Gin Ala Met Ala He Ala Gly Gin He 
20 25 30 



<210> 319 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 319 

ggcctctgcc aatgggaact cagaagtagt aaaactcctg c 41 

<210> 320 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 320 

gcaggagttt tactacttct gagttcccat tggcagaggc c 41 



<210> 321 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 321 

ggggaattcc cgctggtgcc gcgcggcagc cctatggtgg ttgaggttga 50 
ttccatgccg 60 



<210> 322 
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<211> 42 
<212> DNA 

<213> Artificial Sequnce 



<220> 

<223> PCR primer 
<400> 322 

cccgaattct tatttatttc tggttcttga gacattttct gg 

<210> 323 

<211> 1590 

<212> DNA 

<213> Homo sapiens 



42 



<400> 323 

atgcatcacc 

cagggattcg 

accgttcata 

ggcgcacgag 

ggcgacgtga 

gcgcttaacg 

ggcacgcgta 

ccgcgcggca 

aagccatttg 

agggagagcg 

acactcagga 

ggcaagagca 

aacaagatgg 

aaggtgggcg 

cgtggagaag 

gatctcatcg 

gctctacatc 

cgatgtcaac 

tgccaggaag 

gatgagtatg 

aaagcactgc 

ctgttacttg 

gcgaatttaa 

ggatcagcaa 

ctatctggac 

ttactttctg 

gaaaatgtct 



atcaccatca cacggccgcg 



ccattccgat 
tcgggcctac 
tccaacgcgt 
tcaccgcggt 
ggcatcatcc 
cagggaacgt 
gccctatggt 
gtctcaggag 
gcaagagcaa 
gcaagatggg 
acgtgggcgc 
gcaagtggtg 
cttggggaga 
atctggacaa 
tcatgctcag 
tggcctctgc 
ttaatgtcct 
atgaatgtgc 
gaaataccac 
tcttatatgg 
gtgtacatga 
atgcactgga 
gtatagtcag 
agacggccag 
actacaaaga 
caagaaccag 



cgggcaggcg 
cgccttcctc 
ggtcgggagc 
cgacggcgct 
cggtgacgtc 
gacattggcc 
ggttgaggtt 
caagatgggc 
cgtgggcact 
caagtggtgc 
ttctggagac 
ctgccactgc 
ctacgatgac 
gctccacaga 
ggacactgac 
caatgggaat 
tgacaacaaa 
gttaatgttg 
tctgcactac 
tgctgatatc 
gcaaaaacag 
tagatatgga 
ccttctactt 
agagtatgct 
aaaacagatg 
aaataaataa 



tccgataact 
atggcgatcg 
ggcttgggtg 
gctccggcgg 
ccgatcaact 
atctcggtga 
gagggacccc 
gattccatgc 
aagtggtgct 
tctggagacc 
cgccactgct 
cacgacgact 
ttcccctgct 
agygccttca 
gctgcctggt 
gtgaacaaga 
tcagaagtag 
aagaggacag 
ctggaacatg 
gctatctata 
gaatcaaaaa 
caagtcgtga 
aggactgctc 
gagcaaaata 
gtttctagtc 
ctaaaaatct 



tccagctgtc 
cgggccagat 
ttgtcgacaa 
caagtctcgg 
cggccaccgc 
cctggcaaac 
cggccgaatt 
cggctgcttc 
gccgttgctt 
acgacgactc 
tcccctgctg 
ctgctatgaa 
gcagggggag 
tggagcccag 
ggggtaaagt 
aggacaagca 
taaaactcct 
ctctgataaa 
gcactgatcc 
atgaagataa 
acaagcatgg 
aatttttaat 
tcatacttgc 
ttgatgtatc 
atcatcatgt 
cttctgaaaa 



ccagggtggg 
caagcttccc 
caacggcaac 
catctccacc 
gatggcggac 
caagtcgggc 
cccgctggtg 
ttctgtgaag 
cccctgctgc 
tgctatgaag 
cagggggagt 
gacactcagg 
cggcaagagc 
gtaccacgtc 
ccccagaaag 
aaagaggact 
gctggacaga 
ggccgtacaa 
aaatattcca 
attaatggcc 
cctcacacca 
caagaaaaaa 
tgtatgttgt 
ttctcaagat 
aatttgccag 
cagcaatcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1590 



<210> 324 
<211> 529 
<212> PRT 
<213> Homo sapiens 

<400> 324 

Met His His His His His His Thr Ala Ala Ser Asp Asn Phe Gin Leu 
5 10 15 

Ser Gin Gly Gly Gin Gly Phe Ala lie Pro lie Gly Gin Ala Met Ala 
20 25 30 
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He Ala Gly Gin He Lys Leu Pro Thr Val His He Gly Pro Thr Ala 
35 40 45 

Phe Leu Gly Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val 
50 55 60 

Gin Arg Val Val Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr 
65 70 75 80 

Gly Asp Val He Thr Ala Val Asp Gly Ala Pro He Asn Ser Ala Thr 
85 90 95 

Ala Met Ala Asp Ala Leu Asn Gly His His Pro Gly Asp Val He Ser 
100 105 HO 

Val Thr Trp Gin Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr 
115 120 125 

Leu Ala Glu Gly Pro Pro Ala Glu Phe Pro Leu Val Pro Arg Gly Ser 
130 135 140 

Pro Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys 
145 150 155 160 

Lys Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys 
165 170 175 

Phe Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly 
180 185 190 

Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lys 
195 200 205 

Trp Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn 
210 215 220 

Val Gly Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg 
225 230 235 240 

Asn Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly 
245 250 255 

Ser Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala 
260 265 270 

Phe Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu 
275 280 285 

His Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu He Val 
290 295 300 

Met Leu Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin Lys Arg Thr 
305 310 315 320 

Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu 
325 330 335 

Leu Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg 
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340 345 350 

Thr Ala Leu lie Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu 
355 360 365 

Met Leu Leu Glu His Gly Thr Asp Pro Asn lie Pro Asp Glu Tyr Gly 
370 375 380 

Asn Thr Thr Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met Ala 
385 390 395 400 

Lys Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser Lys Asn Lys His 
405 410 415 

Gly Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val 
420 425 430 

Val Lys Phe Leu lie Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg 
435 440 445 

Tyr Gly Arg Thr Ala Leu lie Leu Ala Val Cys Cys Gly Ser Ala Ser 
450 455 460 

lie Val Ser Leu Leu Leu Glu Gin Asn lie Asp Val Ser Ser Gin Asp 
465 470 475 480 

Leu Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His 
485 490 495 

Val lie Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys 
500 505 510 

lie Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg Thr Arg Asn 
515 520 525 

Lys 



<210> 325 
<211> 1155 
<212> DNA 

<213> Homo sapiens 
<400> 325 

atggtggctg aggtttgttc aatgcccact gcctctactg tgaagaagcc atttgatctc 60 

aggagcaaga tgggcaagtg gtgccaccac cgcttcccct gctgcagggg gagcggcaag 120 

agcaacatgg gcacttctgg agaccacgac gactccttta tgaagatgct caggagcaag 180 

atgggcaagt gttgccgcca ctgcttcccc tgctgcaggg ggagcggcac gagcaacgtg 240 

ggcacttctg gagaccatga aaactccttt atgaagatgc tcaggagcaa gatgggcaag 300 

tggtgctgtc actgcttccc ctgctgcagg gggagcggca agagcaacgt gggcgcttgg 360 

ggagactacg accacagcgc cttcatggag ccgaggtacc acatccgtcg agaagatctg 420 

gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 

ctcagggaca ctgacatgaa caagagggac aaggaaaaga ggactgctct acatttggcc 540 

tctgccaatg gaaattcaga agtagtacaa ctcctgctgg acagacgatg tcaacttaat 600 

gtccttgaca acaaaaaaag gacagctctg ataaaggcca tacaatgcca ggaagatgaa 660 

tgtgtgttaa tgttgctgga acatggcgct gatcgaaata ttccagatga gtatggaaat 720 

accgctctac actatgctat ctacaatgaa gataaattaa tggccaaagc actgctctta 780 

tatggtgctg atattgaatc aaaaaacaag gttggcctca caccactttt gcttggcgta 840 



WO03013431 r htto://www.qetthe p ^^ 

WO 03/013431 PCT/US02/24917 

106 

catgaacaaa aacagcaagt ggtgaaattt ttaatcaaga aaaaagctaa tttaaatgta 900 
cttgatagat atggaaggac tgccctcata cttgctgtat gttgtggatc agcaagtata 960 
gtcaatcttc tacttgagca aaatgttgat gtatcttctc aagatctatc tggacagacg 1020 
gccagagagt atgctgtttc tagtcatcat catgtaattt gtgaattact ttctgactat 1080 
aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaaa tgtctcaaga 1140 
accagaaata aataa 

<210> 326 
<211> 384 
<212> PRT 
<213> Homo sapiens 

<400> 326 _ _ _ 

Met Val Ala Glu Val Cys Ser Met Pro Thr Ala Ser Thr Val Lys Lys 
5 10 15 

Pro Phe Asp Leu Arg Ser Lys Met Gly Lys Trp Cys His His Arg Phe 
20 25 30 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Met Gly Thr Ser Gly Asp 
35 40 45 

His Asp Asp Ser Phe Met Lys Met Leu Arg Ser Lys Met Gly Lys Cys 
50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Thr Ser Asn Val 
65 70 75 80 

Gly Thr Ser Gly Asp His Glu Asn Ser Phe Met Lys Met Leu Arg Ser 
85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 
100 105 HO 

Gly Lys Ser Asn Val Gly Ala Trp Gly Asp Tyr Asp His Ser Ala Phe 
115 120 125 

Met Glu Pro Arg Tyr His He Arg Arg Glu Asp Leu Asp Lys Leu His 
130 135 140 

Ara Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu He Val Met 
150 155 160 



145 



Leu Arg Asp Thr Asp Met Asn Lys Arg Asp Lys Glu Lys Arg Thr Ala 
165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Gin Leu Leu 
180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 
195 200 205 

Ala Leu He Lys Ala He Gin Cys Gin Glu Asp Glu Cys Val Leu Met 
210 215 220 

Leu Leu Glu His Gly Ala Asp Arg Asn He Pro Asp Glu Tyr Gly Asn 
225 230 235 240 
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Thr Ala Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met Ala Lys 
245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser Lys Asn Lys Val Gly 
260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 
275 280 285 

Lys Phe Leu lie Lys Lys Lys Ala Asn Leu Asn Val Leu Asp Arg Tyr 
290 295 300 

Gly Arg Thr Ala Leu He Leu Ala Val Cys Cys Gly Ser Ala Ser He 
305 310 315 320 

Val Asn Leu Leu Leu Glu Gin Asn Val Asp Val Ser Ser Gin Asp Leu 
325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 
340 345 350 

He Cys Glu Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys He 
355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg Thr Arg Asn Lys 
370 375 380 

<210> 327 

<211> 634 

<212> DNA 

<213> Homo sapiens 

<400> 327 

gactgctcta catctggcct ctgccaatgg aaattcagaa gtagtaaaac tcctgctgga 60 
cagacgatgt caacttaata tccttgacaa caaaaagagg acagctctga caaaggccgt 120 
acaatgccag gaagatgaat gtgcgttaat gttgctggaa catggcactg atccgaatat 180 
tccagatgag tatggaaata ccgctctaca ctatgctatc tacaatgaag ataaattaat 240 
ggccaaagca ctgctcttat acggtgctga tatcgaatca aaaaacaagc atggcctcac 300 
accactgtta cttggtgtac atgagcaaaa acagcaagtg gtgaaatttt taatcaagaa 360 
aaaagcaaat ttaaatgcac tggatagata tggaagaact gctctcatac ttgctgtatg 420 
ttgtggatcg gcaagtatag tcagccttct acttgagcaa aacattgatg tatcttctca 480 
agatctatct ggacagacgg ccagagagta tgctgtttct agtcgtcata atgtaatttg 540 
ccagttactt tctgactaca aagaaaaaca gatactaaaa gtctcttctg aaaacagcaa 600 
tccaggaaat gtctcaagaa ccagaaataa ataa 634 

<210> 328 

<211> 1155 

<212> DNA 

<213> Homo sapiens 

<400> 328 

atggtggttg aggttgattc catgccggct gcctcttctg tgaagaagcc atttggtctc 60 
aggagcaaga tgggcaagtg gtgctgccgt tgcttcccct gctgcaggga gagcggcaag 120 
agcaacgtgg gcacttctgg agaccacgac gactctgcta tgaagacact caggagcaag 180 
atgggcaagt ggtgccgcca ctgcttcccc tgctgcaggg ggagtggcaa gagcaacgtg 240 
ggcgcttctg gagaccacga cgactctgct atgaagacac tcaggaacaa gatgggcaag 300 
tggtgctgcc actgcttccc ctgctgcagg gggagcagca agagcaaggt gggcgcttgg 360 
ggagactacg atgacagtgc cttcatggag cccaggtacc acgtccgtgg agaagatctg 420 
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gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 
ctcagggaca ctgacgtgaa caagcaggac aagcaaaaga ggactgctct acatctggcc 540 
tctgccaatg ggaattcaga agtagtaaaa ctcctgctgg acagacgatg tcaacttaat 600 
gtccttgaca acaaaaagag gacagctctg ataaaggccg tacaatgcca ggaagatgaa 660 
tgtgcgttaa tgttgctgga acatggcact gatccaaata ttccagatga gtatggaaat 720 
accactctgc actacgctat ctataatgaa gataaattaa tggccaaagc actgctctta 780 
tatggtgctg atatcgaatc aaaaaacaag catggcctca caccactgtt acttggtgta 840 
catgagcaaa aacagcaagt cgtgaaattt ttaattaaga aaaaagcgaa tttaaatgca 900 
ctggatagat atggaaggac tgctctcata cttgctgtat gttgtggatc agcaagtata 960 
gtcagccttc tacttgagca aaatattgat gtatcttctc aagatctatc tggacagacg 1020 
gccagagagt atgctgtttc tagtcatcat catgtaattt gccagttact ttctgactac 1080 
aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaaa tgtctcaaga 1140 
accagaaata aataa 1155 

<210> 329 
<211> 1155 
<212> DNA 

<213> Homo sapiens 
<400> 329 

atggtggctg aggtttgttc aatgcccgct gcctctgctg tgaagaagcc atttgatctc 60 

aggagcaaga tgggcaagtg gtgccaccac cgcttcccct gctgcagggg gagcggcaag 120 

agcaacatgg gcacttctgg agaccacgac gactccttta tgaagacgct caggagcaag 180 

atgggcaagt gttgccacca ctgcttcccc tgctgcaggg ggagcggcac gagcaatgtg 2 40 

ggcacttctg gagaccatga caactccttt atgaagacac tcaggagcaa gatgggcaag 300 

tggtgctgtc actgcttccc ctgctgcagg gggagcggca agagcaacgt gggcacttgg 360 

ggagactacg acgacagcgc cttcatggag ccgaggtacc acgtccgtcg agaagatctg 420 

gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 

ctcagggaca ctgacatgaa caagagggac aagcaaaaga ggactgctct acatttggcc 540 

tctgccaatg gaaattcaga agtagtacaa ctcctgctgg acagacgatg tcaacttaac 600 

gtccttgaca acaaaaaaag gacagctctg ataaaggccg tacaatgcca ggaagatgaa 660 

tgtgtgttaa tgttgctgga acatggcgct gatggaaata ttcaagatga gtatggaaat 720 

accgctctac actatgctat ctacaatgaa gataaattaa tggccaaagc actgctctta 780 

tatggtgctg atattgaatc aaaaaacaag tgtggcctca caccactttt gcttggcgta 840 

catgaacaaa aacagcaagt ggtgaaattt ttaatcaaga aaaaagctaa tttaaatgca 900 

cttgatagat atggaagaac tgccctcata cttgctgtat gttgtggatc agcaagtata 960 

gtcaatcttc tacttgagca aaatgttgat gtatcttctc aagatctatc tggacagacg 1020 

gccagagagt atgctgtttc tagtcatcat catgtaattt gtgaattact ttctgactat 1080 

aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaaa tgtctcaaga 1140 

accagaaata aataa 1155 

<210> 330 
<211> 1155 
<212> DNA 

<213> Homo sapiens 
<400> 330 

atggtggctg aggtttgttc aatgcccact gcctctactg tgaagaagcc atttgatctc 60 
aggagcaaga tgggcaagtg gtgccaccac cgcttcccct gctgcagggg gagcggcaag 120 
agcaacatgg gcacttctgg agaccacgac gactccttta tgaagatgct caggagcaag 180 
atgggcaagt gttgccgcca ctgcttcccc tgctgcaggg ggagcggcac gagcaacgtg 240 
ggcacttctg gagaccatga aaactccttt atgaagatgc tcaggagcaa gatgggcaag 300 
tggtgctgtc actgcttccc ctgctgcagg gggagcggca agagcaacgt gggcgcttgg 360 
ggagactacg accacagcgc cttcatggag ccgaggtacc acatccgtcg agaagatctg 420 
gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 
ctcagggaca ctgacatgaa caagagggac aaggaaaaga ggactgctct acatttggcc 540 
tctgccaatg gaaattcaga agtagtacaa ctcctgctgg acagacgatg tcaacttaat 600 
gtccttgaca acaaaaaaag gacagctctg ataaaggcca tacaatgcca ggaagatgaa 660 
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tgtgtgttaa tgttgctgga acatggcgct gatcgaaata ttccagatga gtatggaaat 720 

accgctctac actatgctat ctacaatgaa gataaattaa tggccaaagc actgctctta 780 

tatggtgctg atattgaatc aaaaaacaag tgtggcctca caccactttt gcttggcgta 840 

catgaacaaa aacagcaagt ggtgaaattt ttaatcaaga aaaaagctaa tttaaatgta 900 

cttgatagat atggaagaac tgccctcata cttgctgtat gttgtggatc agcaagtata 960 

gtcaatcttc tacttgagca aaatgttgat gtatcttctc aagatctatc tggacagacg 1020 

gccagagagt atgctgtttc tagtcatcat catgtaattt gtgaattact ttctgactat 1080 

aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaaa tgtctcaaga 1140 

accagaaata aataa 1155 

<210> 331 
<211> 210 
<212> PRT 

<213> Homo sapiens 
<400> 331 

Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys 
5 10 15 

Leu Leu Leu Asp Arg Arg Cys Gin Leu Asn lie Leu Asp Asn Lys Lys 
20 25 30 

Arg Thr Ala Leu Thr Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala 
35 40 45 

Leu Met Leu Leu Glu His Gly Thr Asp Pro Asn lie Pro Asp Glu Tyr 
50 55 60 

Gly Asn- Thr Ala Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met 
65 70 75 80 

Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser Lys Asn Lys 
85 90 95 

His Gly Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin 
100 105 110 

Val Val Lys Phe Leu lie Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp 
115 120 125 

Arg Tyr Gly Arg Thr Ala Leu lie Leu Ala Val Cys Cys Gly Ser Ala 
130 135 140 

Ser lie Val Ser Leu Leu Leu Glu Gin Asn lie Asp Val Ser Ser Gin 
145 150 155 160 

Asp Leu Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser Arg His 
165 170 175 

Asn Val lie Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin lie Leu 
180 185 190 

Lys Val Ser Ser Glu Asn Ser Asn Pro Gly Asn Val Ser Arg Thr Arg 
195 200 205 



Asn Lys 
210 
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<210> 332 
<211> 384 
<212> PRT 
<213> Homo sapiens 

<400> 332 

Met Val Ala Glu Val Cys Ser Met Pro Thr Ala Ser Thr Val Lys Lys 
5 10 15 

pro Phe Asp Leu Arg Ser Lys Met Gly Lys Trp Cys His His Arg Phe 
20 25 30 

pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Met Gly Thr Ser Gly Asp 
35 40 45 

His Asp Asp Ser Phe Met Lys Met Leu Arg Ser Lys Met Gly Lys Cys 
50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Thr Ser Asn Val 
65 70 75 80 

Gly Thr Ser Gly Asp His Glu Asn Ser Phe Met Lys Met Leu Arg Ser 
85 9° 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 
100 105 H° 

Gly Lys Ser Asn Val Gly Ala Trp Gly Asp Tyr Asp His Ser Ala Phe 
115 120 125 

Met Glu Pro Arg Tyr His He Arg Arg Glu Asp Leu Asp Lys Leu His 
130 135 1*0 

Ara Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu He Val Met 
145 150 155 160 

Leu Arg Asp Thr Asp Met Asn Lys Arg Asp Lys Glu Lys Arg Thr Ala 
165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Gin Leu Leu 
180 185 19° 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 
195 200 205 

Ala Leu He Lys Ala He Gin Cys Gin Glu Asp Glu Cys Val Leu Met 
210 215 220 

Leu Leu Glu His Gly Ala Asp Arg Asn He Pro Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Ala Leu His Tyr Ala He Tyr Asn Glu Asp Lys Leu Met Ala Lys 
245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser Lys Asn Lys Cys Gly 
260 265 270 
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Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 
275 280 285 

Lys Fhe Leu lie Lys Lys Lys Ala Asn Leu Asn Val Leu Asp Arg Tyr 
290 295 300 

Gly Arg Thr Ala Leu lie Leu Ala Val Cys Cys Gly Ser Ala Ser lie 
305 310 315 320 

Val Asn Leu Leu Leu Glu Gin Asn Val Asp Val Ser Ser Gin Asp Leu 
325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 
340 345 350 

lie Cys Glu Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys lie 
355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg Thr Arg Asn Lys 
370 375 380 



<210> 333 
<211> 384 
<212> PRT 

<213> Homo sapiens 
<400> 333 

Met Val Ala Glu Val Cys Ser Met Pro Ala Ala Ser Ala Val Lys Lys 
5 10 15 

Pro Phe Asp Leu Arg Ser Lys Met Gly Lys Trp Cys His His Arg Phe 
20 25 30 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Met Gly Thr Ser Gly Asp 
35 40 45 

His Asp Asp Ser Phe Met Lys Thr Leu Arg Ser Lys Met Gly Lys Cys 
50 55 60 

Cys His His Cys Phe Pro Cys Cys Arg Gly Ser Gly Thr Ser Asn Val 
65 70 75 80 

Gly Thr Ser Gly Asp His Asp Asn Ser Phe Met Lys Thr Leu Arg Ser 
85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 
100 105 110 

Gly Lys Ser Asn Val Gly Thr Trp Gly Asp Tyr Asp Asp Ser Ala Phe 
115 120 125 

Met Glu Pro Arg Tyr His Val Arg Arg Glu Asp Leu Asp Lys Leu His 
130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu lie Val Met 
145 150 155 160 
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Leu Arg Asp Thr Asp Met Asn Lys Arg Asp Lys Gin Lys Arg Thr Ala 
165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Gin Leu Leu 
180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 
195 200 205 

Ala Leu He Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Val Leu Met 
210 215 220 

Leu Leu Glu His Gly Ala Asp Gly Asn He Gin Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Ala Leu His Tyr Ala He Tyr Asn Glu Asp Lys Leu Met Ala Lys 
245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser Lys Asn Lys Cys Gly 
260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 
275 280 285 

Lys Phe Leu He Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr 
290 295 300 

Gly Arg Thr Ala Leu He Leu Ala Val Cys Cys Gly Ser Ala Ser He 
305 310 315 320 

Val Asn Leu Leu Leu Glu Gin Asn Val Asp Val Ser Ser Gin Asp Leu 
325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 
340 345 350 

He Cys Glu Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys He 
355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg Thr Arg Asn Lys 
370 375 380 



<210> 334 
<211> 384 
<212> PRT 
<213> Homo sapiens 

<400> 334 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 
5 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe 
20 25 30 

Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 
35 40 45 
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His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lys Trp 
50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val 
65 70 75 80 

Gly Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Asn 
85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 
100 105 110 

Ser Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe 
115 120 125 

Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu His 
130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu lie Val Met 
145 150 155 160 

Leu Arg Asp Thr Asp Val Asn Lys Gin Asp Lys Gin Lys Arg Thr Ala 
165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu 
180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 
195 200 205 

Ala Leu lie Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met 
210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn lie Pro Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Thr Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met Ala Lys 
245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser Lys Asn Lys His Gly 
260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 
275. 280 285 

Lys Phe Leu He Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr 
290 295 300 

Gly Arg Thr Ala Leu He Leu Ala Val Cys Cys Gly Ser Ala Ser He 
305 310 315 320 

Val Ser Leu Leu Leu Glu Gin Asn He Asp Val Ser Ser Gin Asp Leu 
325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 
340 345 350 

He Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys He 
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355 



360 



365 



Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg Thr Arg Asn Lys 
370 375 380 



<210> 335 

<211> 1185 

<212> DNA 

<213> Homo sapiens 



<400> 335 

atggtggttg 

aggagcaaga 

agcaacgtgg 

atgggcaagt 

ggcgcttctg 

tggtgctgcc 

ggagactacg 

gacaagctcc 

ctcagggaca 

tctgccaatg 

gtccttgaca 

tgtgcgttaa 

accactctgc 

tatggtgctg 

catgagcaaa 

ctggatagat 

gtcagccttc 

gccagagagt 

aaagaaaaac 

accagaaata 



aggttgattc 
tgggcaagtg 
gcacttctgg 
ggtgccgcca 
gagaccacga 
actgcttccc 
atgacagtgc 
acagagctgc 
ctgacgtgaa 
ggaattcaga 
acaaaaagag 
tgttgctgga 
actacgctat 
atatcgaatc 
aacagcaagt 
atggaaggac 
tacttgagca 
atgctgtttc 
agatgctaaa 
aacatcatca 



catgccggct 
gtgctgccgt 
agaccacgac 
ctgcttcccc 
cgactctgct 
ctgctgcagg 
cttcatggag 
ctggtggggt 
caagaaggac 
agtagtaaaa 
gacagctctg 
acatggcact 
ctataatgaa 
aaaaaacaag 
cgtgaaattt 
tgctctcata 
aaatattgat 
tagtcatcat 
aatctcttct 
ccatcatcat 



gcctcttctg 
tgcttcccct 
gactctgcta 
tgctgcaggg 
atgaagacac 
gggagcggca 
cccaggtacc 
aaagtcccca 
aagcaaaaga 
ctcctgctgg 
ataaaggccg 
gatccaaata 
gataaattaa 
catggcctca 
ttaatcaaga 
cttgctgtat 
gtatcttctc 
catgtaattt 
gaaaacagca 
caccatcacc 



tgaagaagcc 
gctgcaggga 
tgaagacact 
ggagtggcaa 
tcaggaacaa 
agagcaaggt 
acgtccgtgg 
gaaaggatct 
ggactgctct 
acagacgatg 
tacaatgcca 
ttccagatga 
tggccaaagc 
caccactgtt 
aaaaagcgaa 
gttgtggatc 
aagatctatc 
gccagttact 
atccagaaaa 
attaa 



atttggtctc 
gagcggcaag 
caggagcaag 
gagcaacgtg 
gatgggcaag 
gggcgcttgg 
agaagatctg 
catcgtcatg 
acatctggcc 
tcaacttaat 
ggaagatgaa 
gtatggaaat 
actgctctta 
acttggtgta 
tttaaatgca 
agcaagtata 
tggacagacg 
ttctgactac 
tgtctcaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1185 



<210> 336 
<211> 394 
<212> PRT 

<213> Homo sapiens 
<400> 336 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 
5 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe 
20 25 30 

Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 
35 40 45 

His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lys Trp 
50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val 
65 70 75 80 



Gly Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Asn 
85 90 95 
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Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 
100 105 110 

Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe 
115 120 125 

Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu His 
130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu lie Val Met 
145 150 155 160 

Leu Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala 
165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu 
180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 
195 200 205 

Ala Leu lie Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met 
210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn lie Pro Asp Glu Tyr Gly Asn 
225 230 235 . 240 

Thr Thr Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met Ala Lys 
245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser Lys Asn Lys His Gly 
260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 
275 280 285 

Lys Phe Leu lie Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr 
290 295 300 

Gly Arg Thr Ala Leu lie Leu Ala Val Cys Cys Gly Ser Ala Ser lie 
305 310 315 320 

Val Ser Leu Leu Leu Glu Gin Asn lie Asp Val Ser Ser Gin Asp Leu 
325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 
340 345 350 

lie Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys lie 
355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg Thr Arg Asn Lys 
370 375 380 



His His His His His His His His His His 
385 390 
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<210> 337 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 337 

cggcggatcc accatggtgg ttgaggttga ttcc 34 

<210> 338 
<211> 74 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> PCR primer 
<4O0> 338 

cggctctaga ttaatggtga tggtgatgat gatggtgatg atgtttattt ctggttcttg 60 
agacattttc tgga 74 



<210> 339 

<211> 1166 

<212> DNA 

<213> Homo sapiens 

<400> 339 

atggtggctg aggctggttc aatgccggct gcctcctctg tgaagaagcc atttggtctc 60 

agaagcaaga tgggcaagtg gtgccgccac tgcttcccct ggtgcagggg gagcggcaag 120 

agcaacgtgg gcacttctgg agaccacgac gattctgcta tgaagacact caggagcaag 180 

atgggcaagt ggtgccgcca ctgcttcccc tggtgcaggg ggagcagcaa gagcaacgtg 240 

ggcacttctg gagaccacga cgactctgct atgaagacac tcaggagcaa gatgggcaag 300 

tggtgctgcc actgcttccc ctgctgcagg gggagcggca agagcaaagt gggcccttgg 360 

ggagactacg acgacagcgc tttcatggag ccgaggtacc acgtccgtcg agaagatctg 420 

gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 

ctcaaggaca ctgacatgaa caagaaggac aagcaaaaga ggactgctct acatctggcc 540 

tctgccaatg gaaattcaga agtagtaaaa ctcctgctgg acagacgatg tcaacttaat 600 

atccttgaca acaaaaagag gacagctctg acaaaggccg tacaatgccg ggaagatgaa 660 

tgtgcgttaa tgttgctgga acatggcact gatccgaata ttccagatga gtatggaaat 720 

accgctctac actatgctat ctacaatgaa gataaattaa tggccaaagc actgctctta 780 

tacggtgctg atatcgaatc aaaaaacaag catggcctca caccactgtt acttggtgta 840 

catgagcaaa aacagcaagt ggtgaaattc ttaatcaaga aaaaagcaaa tttaaatgca 900 

ctggatagat atggaagaac tgctctcata cttgctgtat gttgtggatc ggcaagtata 960 

gtcagccttc tacttgagca aaacattgat gtatcttctc aagatctatc tggacagacg 1020 

gccagagagt atgctgtttc tagtcatcat aatgtaattt gccagttact ttctgactac 1080 

aaagaaaaac agatgctaaa agtctcttct gaaaacagca atccaggaaa tgtctcaaga 1140 
accagaaata aataagggtg gtgata 1166 

<210> 340 
<211> 384 
<212> PRT 
<213> Homo sapiens 
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<400> 340 

Met Val Ala Glu Ala Gly Ser Met Pro Ala Ala Ser Ser Val Lys Lys 
5 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Arg His Cys Phe 
20 25 30 

Pro Trp Cys Arg Gly Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 
35 40 45 

His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lys Trp 
50 55 60 

Cys Arg His Cys Phe Pro Trp Cys Arg Gly Ser Ser Lys Ser Asn Val 
65 70 75 80 

Gly Thr Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser 
85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 
100 105 110 

Gly Lys Ser Lys Val Gly Pro Trp Gly Asp Tyr Asp Asp Ser Ala Phe 
115 120 125 

Met Glu Pro Arg Tyr His Val Arg Arg Glu Asp Leu Asp Lys Leu His 
130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu lie Val Met 
145 150 155 160 

Leu Lys Asp Thr Asp Met Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala 
165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu 
180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn lie Leu Asp Asn Lys Lys Arg Thr 
195 200 205 

Ala Leu Thr Lys Ala Val Gin Cys Arg Glu Asp Glu Cys Ala Leu Met 
210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn lie Pro Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Ala Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met Ala Lys 
245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser Lys Asn Lys His Gly 
260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 
275 280 285 



Lys Phe Leu lie Lys Lys Lys Ala Asn Leu Asn Ala 
290 295 300 



Leu Asp Arg Tyr 
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Gly Arg Thr Ala Leu He Leu Ala Val Cys Cys Gly Ser Ala Ser He 
305 310 315 320 

Val Ser Leu Leu Leu Glu Gin Asn He Asp Val Ser Ser Gin Asp Leu 
325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His Asn Val 
340 345 350 

He Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys Val 
355 360 365 

Ser Ser Glu Asn Ser Asn Pro Gly Asn Val Ser Arg Thr Arg Asn Lys 
370 375 380 



<210> 341 
<211> 876 
<212> DNA 
<213> Homo sapiens 

<400> 341 

atgcatcttt catttcctgc atttcttcct ccctggatgg acagggggag cggcaagagc 60 
aacgtgggca cttctggaga ccacaacgac tcctctgtga agacgcttgg gagcaagagg 120 
tgcaagtggt gctgccactg cttcccctgc tgcaggggga gcggcaagag caacgtggtc 180 
gcttggggag actacgatga cagcgccttc atggatccca ggtaccacgt ccatggagaa 240 
gatctggaca agctccacag agctgcctgg tggggtaaag tccccagaaa ggatctcatc 300 
gtcatgctca gggacacgga tgtgaacaag agggacaagc aaaagaggac tgctctacat 360 
ctggcctctg ccaatgggaa ttcagaagta gtaaaactcg tgctggacag acgatgtcaa 420 
cttaatgtcc ttgacaacaa aaagaggaca gctctgacaa aggccgtaca atgccaggaa 480 
gatgaatgtg cgttaatgtt gctggaacat ggcactgatc caaatattcc agatgagtat 540 
ggaaatacca ctctacacta tgctgtctac aatgaagata aattaatggc caaagcactg 600 
ctcttatacg gtgctgatat cgaatcaaaa aacaagcatg gcctcacacc actgctactt 660 
ggtatacatg agcaaaaaca gcaagtggtg aaatttttaa tcaagaaaaa agcgaattta 720 
aatgcgctgg atagatatgg aagaactgct ctcatacttg ctgtatgttg tggatcagca 780 
agtatagtca gccctctact tgagcaaaat gttgatgtat cttctcaaga tctggaaaga 840 
cggccagaga gtatgctgtt tctagtcatc atcatg 876 

<210> 342 
<211> 876 
<212> DNA 
<213> Homo sapiens 

<400> 342 

atgcatcttt catttcctgc atttcttcct ccctggatgg acagggggag cggcaagagc 60 
aacgtgggca cttctggaga ccacaacgac tcctctgtga agacgcttgg gagcaagagg 120 
tgcaagtggt gctgccactg cttcccctgc tgcaggggga gcggcaagag caacgtgggc 180 
gcttggggag actacgatga cagcgccttc atggatccca ggtaccacgt ccatggagaa 240 
gatctggaca agctccacag agctgcctgg tggggtaaag tccccagaaa ggatctcatc 300 
gtcatgctca gggacactga tgtgaacaag agggacaagc aaaagaggac tgctctacat 360 
ctggcctctg ccaatgggaa ttcagaagta gtaaaactcg tgctggacag acgatgtcaa 420 
cttaatgtcc ttgacaacaa aaagaggaca gctctgacaa aggccgtaca atgccaggaa 480 
gatgaatgtg cgttaatgtt gctggaacat ggcactgatc caaatattcc agatgagtat 540 
ggaaatacca ctctacacta tgctgtctac aatgaagata aattaatggc caaagcactg 600 
ctcttatacg gtgctgatat cgaatcaaaa aacaagcatg gcctcacacc actgctactt 660 
ggtatacatg agcaaaaaca gcaagtggtg aaatttttaa tcaagaaaaa agcgaattta 720 
aatgcgctgg atagatatgg aagaactgct ctcatacttg ctgtatgttg tggatcagca 780 
agtatagtca gccctctact tgagcaaaat gttgatgtat cttctcaaga tctggaaaga 840 
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cggccagaga gtatgctgtt tctagtcatc atcatg 



876 



<210> 343 
<211> 933 
<212> DNA 

<213> Homo sapiens 



<400> 343 

atggtggttg 

aggagcaaga 

gtgggcactt 

aagtggtgct 

tggggagact 

ctggacaagc 

atgctcaggg 

gcctctgcca 

aatgtccttg 

gaatgtgcgt 

aataccactc 

ttatacggtg 

atacatgagc 

gcgctggata 

atagtcagcc 

ccagagagta 



aggttgattc 
tgggcaagtg 
ctggagacca 
gccactgctt 
acgatgacag 
tccacagagc 
acactgatgt 
atgggaattc 
acaacaaaaa 
taatgttgct 
tacactatgc 
ctgatatcga 
aaaaacagca 
gatatggaag 
ctctacttga 
tgctgtttct 



aatgccggct 
gtgctgcttt 
caacgactcc 
cccctgctgc 
cgccttcatg 
tgcctggtgg 
gaacaagagg 
agaagtagta 
gaggacagct 
ggaacatggc 
tgtctacaat 
atcaaaaaac 
agtggtgaaa 
aactgctctc 
gcaaaatgtt 
agtcatcatc 



gcctcttctg 
ccctgctgca 
tctgtgaaga 
agggggagcg 
gatcccaggt 
ggtaaagtcc 
gacaagcaaa 
aaactcgtgc 
ctgacaaagg 
actgatccaa 
gaagataaat 
aagcatggcc 
tttttaatca 
atacttgctg 
gatgtatctt 
atg 



tgaagaagcc 
gggggagcgg 
cgcttgggag 
gcaagagcaa 
accacgtcca 
ccagaaagga 
agaggactgc 
tggacagacg 
ccgtacaatg 
atattccaga 
taatggccaa 
tcacaccact 
agaaaaaagc 
tatgttgtgg 
ctcaagatct 



atttggtctc 60 
caagagcaac 120 
caagaggtgc 180 
cgtgggcgct 240 
tggagaagat 300 
tctcatcgtc 360 
tctacatctg 420 
atgtcaactt 480 
ccaggaagat 540 
tgagtatgga 600 
agcactgctc 660 
gctacttggt 720 
gaatttaaat 780 
atcagcaagt 840 
ggaaagacgg 900 
933 



<210> 344 
<211> 939 
<212> DNA 
<213> Homo sapiens 



<400> 344 

atggtggttg 

aggagcaaga 

agcaacgtgg 

aggtgcaagt 

gtcgcttggg 

gaagatctgg 

atcgtcatgc 

catctggcct 

caacttaatg 

gaagatgaat 

tatggaaata 

ctgctcttat 

cttggtatac 

ttaaatgcgc 

gcaagtatag 

agacggccag 



aggttgattc 
tgggcaagtg 
gcacttctgg 
ggtgctgcca 
gagactacga 
acaagctcca 
tcagggacac 
ctgccaatgg 
tccttgacaa 
gtgcgttaat 
ccactctaca 
acggtgctga 
atgagcaaaa 
tggatagata 
tcagccctct 
agagtatgct 



aatgccggct 
gtgctgccac 
agaccacaac 
ctgcttcccc 
tgacagcgcc 
cagagctgcc 
ggatgtgaac 
gaattcagaa 
caaaaagagg 
gttgctggaa 
ctatgctgtc 
tatcgaatca 
acagcaagtg 
tggaagaact 
acttgagcaa 
gtttctagtc 



gcctcttctg 
tgctttccct 
gactcctctg 
tgctgcaggg 
ttcatggatc 
tggtggggta 
aagagggaca 
gtagtaaaac 
acagctctga 
catggcactg 
tacaatgaag 
aaaaacaagc 
gtgaaatttt 
gctctcatac 
aatgttgatg 
ate atcatg 



tgaagaagcc 
getgeagggg 
tgaagacget 
ggagcggcaa 
ccaggtacca 
aagtccccag 
agcaaaagag 
tcgtgctgga 
caaaggcegt 
atccaaatat 
ataaattaat 
atggcctcac 
taatcaagaa 
ttgctgtatg 
tatcttctca 



atttggtctc 60 
gageggcaag 120 
tgggagcaag 180 
gagcaaegtg 240 
cgtccatgga 300 
aaaggatctc 360 
gaetgetcta 420 
cagacgatgt 480 
acaatgecag 540 
tccagatgag 600 
ggecaaagea 660 
accactgcta 720 
aaaagcgaat 780 
ttgtggatca 840 
agatctggaa 900 
939 



<210> 345 
<211> 292 
<212> PRT 

<213> Homo sapiens 



<400> 345 

Met His Leu Ser Phe Pro Ala Phe Leu Pro Pro Trp Met Asp Arg Gly 
5 10 15 



Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asn Asp Ser Ser 
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20 25 30 

Val Lys Thr Leu Gly Ser Lys Arg Cys Lys Trp Cys Cys His Cys Phe 
35 40 45 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Val Ala Trp Gly Asp 
50 55 60 

Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr His Val His Gly Glu 
6 5 70 75 80 

Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val Pro Arg 
85 90 95 

Lys Asp Leu He Val Met Leu Arg Asp Thr Asp Val Asn Lys Arg Asp 
100 105 HO 

Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser 
115 120 125 

Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys Gin Leu Asn Val Leu 
130 135 140 

Asp Asn Lys Lys Arg Thr Ala Leu Thr Lys Ala Val Gin Cys Gin Glu 
145 150 155 160 

Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly Thr Asp Pro Asn He 
165 170 175 

Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala Val Tyr Asn Glu 
180 185 190 

Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp He Glu 
195 200 205 

Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu Leu Gly He His Glu 
210 215 220 

Gin Lys Gin Gin Val Val Lys Phe Leu He Lys Lys Lys Ala Asn Leu 
225 230 235 240 

Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu He Leu Ala Val Cys 
245 250 255 

Cys Gly Ser Ala Ser He Val Ser Pro Leu Leu Glu Gin Asn Val Asp 
260 265 270 

Val Ser Ser Gin Asp Leu Glu Arg Arg Pro Glu Ser Met Leu Phe Leu 
275 280 285 

Val He He Met 
290 



<210> 346 
<211> 292 
<212> PRT 
<213> Homo sapiens 
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<400> 346 

Met His Leu Ser Phe Pro Ala Phe Leu Pro Pro Trp Met Asp Arg Gly 
5 10 15 

Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asn Asp Ser Ser 
20 25 30 

Val Lys Thr Leu Gly Ser Lys Arg Cys Lys Trp Cys Cys His Cys Phe 
35 40 45 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Gly Ala Trp Gly Asp 
50 55 60 

Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr His Val His Gly Glu 
65 70 75 80 

Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val Pro Arg 
85 90 95 

Lys Asp Leu lie Val Met Leu Arg Asp Thr Asp Val Asn Lys Arg Asp 
100 105 110 

Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser 
115 120 125 

Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys Gin Leu Asn Val Leu 
130 135 140 

Asp Asn Lys Lys Arg Thr Ala Leu Thr Lys Ala Val Gin Cys Gin Glu 
145 150 155 160 

Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly Thr Asp Pro Asn lie 
165 170 175 

Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala Val Tyr Asn Glu 
180 185 190 

Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu 
195 200 205 

Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu Leu Gly He His Glu 
210 215 220 

Gin Lys Gin Gin Val Val Lys Phe Leu He Lys Lys Lys Ala Asn Leu 
225 230 235 240 

Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu He Leu Ala Val Cys 
245 250 255 

Cys Gly Ser Ala Ser He Val Ser Pro Leu Leu Glu Gin Asn Val Asp 
260 265 270 

Val Ser Ser Gin Asp Leu Glu Arg Arg Pro Glu Ser Met Leu Phe Leu 
275 280 285 

Val He He Met 
290 
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<210> 347 
<211> 311 
<212> PRT 

<213> Homo sapiens 
<400> 347 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 
5 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Phe Pro Cys 
20 25 30 

Cys Arg Gly Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asn 
35 40 45 

Asp Ser Ser Val Lys Thr Leu Gly Ser Lys Arg Cys Lys Trp Cys Cys 
50 55 60 

His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Gly Ala 
65 70 75 80 

Trp Gly Asp Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr His Val 
85 90 95 

His Gly Glu Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys 
100 105 110 

Val Pro Arg Lys Asp Leu lie Val Met Leu Arg Asp Thr Asp Val Asn 
115 120 125 

Lys Arg Asp Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn 
130 135 140 

Gly Asn Ser Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys Gin Leu 
145 150 155 160 

Asn Val Leu Asp Asn Lys Lys Arg Thr Ala Leu Thr Lys Ala Val Gin 
165 170 175 

Cys Gin Glu Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly Thr Asp 
180 185 190 

Pro Asn lie Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala Val 
195 200 205 

Tyr Asn Glu Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala 
210 215 220 

Asp lie Glu Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu Leu Gly 
225 230 235 240 

lie His Glu Gin Lys Gin Gin Val Val Lys Phe Leu lie Lys Lys Lys 
245 250 255 

Ala Asn Leu Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu lie Leu 
260 265 270 
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Ala Val Cys Cys Gly Ser Ala Ser lie Val Ser Pro Leu Leu Glu Gin 
275 280 285 

Asn Val Asp Val Ser Ser Gin Asp Leu Glu Arg Arg Pro Glu Ser Met 
290 295 300 

Leu Phe Leu Val He He Met 
305 310 



<210> 348 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 
5 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys His Cys Phe 
20 25 30 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 
35 40 45 

His Asn Asp Ser Ser Val Lys Thr Leu Gly Ser Lys Arg Cys Lys Trp 
50 55 60 

Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val 
65 70 75 80 

Val Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr 
85 90 95 

His Val His Gly Glu Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp 
100 105 110 

Gly Lys Val Pro Arg Lys Asp Leu He Val Met Leu Arg Asp Thr Asp 
115 120 125 

Val Asn Lys Arg Asp Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser 
130 135 140 

Ala Asn Gly Asn Ser Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys 
145 150 155 160 

Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr Ala Leu Thr Lys Ala 
165 170 175 

Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly 
180 185 190 

Thr Asp Pro Asn He Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr 
195 200 205 

Ala Val Tyr Asn Glu Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr 
210 215 220 
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Gly Ala Asp lie Glu Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu 
225 230 235 240 

Leu Gly lie His Glu Gin Lys Gin Gin Val Val Lys Phe Leu He Lys 
245 250 255 

Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu 
260 265 270 

lie Leu Ala Val Cys Cys Gly Ser Ala Ser He Val Ser Pro Leu Leu 
275 280 285 

Glu Gin Asn Val Asp Val Ser Ser Gin Asp Leu Glu Arg Arg Pro Glu 
290 295 300 

Ser Met Leu Phe Leu Val He He Met 
305 310 



<210> 349 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 349 

Val Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser 

15 10 15 

Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu Leu Asp Arg 
20 25 30 



<210> 350 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 350 

Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu 

15 10 15 

Leu Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys 
20 25 30 



<210> 351 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 351 

Gly Ser Ala Ser He Val Ser Leu Leu Leu Glu Gin Asn He Asp Val 

15 10 15 

Ser Ser Gin Asp Leu Ser Gly Gin Thr 
20 25 



<210> 352 
<211> 20 
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<212> PRT 

<213> Homo sapiens 

<400> 352 

Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys Pro 

15 10 15 

Phe Gly Leu Arg 
20 



<210> 353 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 353 

Ser Met Pro Ala Ala Ser Ser Val Lys Lys Pro Phe Gly Leu Arg Ser 

1 5 10 15 

Lys Met Gly Lys 
20 



<210> 354 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 354 

Ser Ser Val Lys Lys Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp 

1 5 10 15 

Cys Cys Arg Cys 
20 



<210> 355 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 355 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe 

15 10 15 

Pro Cys Cys Arg 
20 



<210> 356 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 356 

Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe Pro Cys Cys Arg Glu 

15 10 15 

Ser Gly Lys Ser 
20 



WO03013431 mttp://Wvw.qetthepatent^^ 



WO 03/013431 PCT/US02/24917 

126 



<210> 357 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 357 



Trp Cys Cys Arg Cys Phe Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn 
15 10 15 



Val Gly Thr Ser 
20 



<210> 358 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 358 

Phe Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly 

15 10 15 

Asp His Asp Asp 
20 



<210> 359 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 359 

Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asp Asp Ser 

15 10 15 

Ala Met Lys Thr 
20 



<210> 360 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 360 

Asn Val Gly Thr Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu 

15 10 15 

Arg Ser Lys Met 
20 



<210> 361 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 361 

Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly 

15 10 15 

Lys Trp Cys Arg 
20 
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<210> 362 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 362 

Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lys Trp Cys Arg His 

15 10 15 

Cys Phe Pro Cys 
20 



<210> 363 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 363 

Leu Arg Ser Lys Met Gly Lys Trp Cys Arg His Cys Phe Pro Cys Cys 

15 10 15 

Arg Gly Ser Gly 
20 



<210> 364 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 364 

Gly Lys Trp Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys 

1 5 10 15 

Ser Asn Val Gly 
20 



<210> 365 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 365 

His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Gly Ala 

15 10 15 

Ser Gly Asp His 
20 



<210> 366 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 366 

Cys Arg Gly Ser Gly Lys Ser Asn Val Gly Ala Ser Gly Asp His Asp 
1 5 10 15 
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Asp Ser Ala Met 
20 



<210> 367 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 367 

Lys Ser Asn Val Gly Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys 

15 10 15 

Thr Leu Arg Asn 
20 



<210> 368 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 368 

Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Asn Lys 

15 10 15 

Met Gly Lys Trp 
20 



<210> 369 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 369 

Asp Asp Ser Ala Met Lys Thr Leu Arg Asn Lys Met Gly Lys Trp Cys 

15 10 15 

Cys His Cys Phe 
20 



<210> 370 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 370 

Lys Thr Leu Arg Asn Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro 

15 10 15 

Cys Cys Arg Gly 
20 



<210> 371 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 371 



WO0301 3431 rhttp://www.qetthe paten t.corri/Loqin.doq/Sp atents4 /@21 01 2 1 %2E41 9 C1 2 / Fet c h/WO0 301 3431 .c pc? fr om C ache= 1 part^ roaintoo lbar=l:Page 297 of 308 



WO 03/013431 PCT/US02/24917 

129 



Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 

15 10 15 

Gly Lys Ser Lys 
20 



<210> 372 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 372 

Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Lys Val 

15 10 15 

Gly Ala Trp Gly 
20 



<210> 373 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 373 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Lys Val Gly Ala Trp Gly Asp 

15 10 15 

Tyr Asp Asp Ser 
20 



<210> 374 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 374 

Ser Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala 

15 10 15 

Phe Met Glu Pro 
20 



<210> 375 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 375 

Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe Met Glu Pro Arg 

15 10 15 

Tyr His Val Arg 
20 



<210> 376 
<211> 20 
<212> PRT 

<213> Homo sapiens 
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<400> 376 

Asp Tyr Asp Asp Ser Ala Phe Met Glu Pro Arg Tyr His Val Arg Gly 

15 10 15 

Glu Asp Leu Asp 
20 



<210> 377 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 377 

Ala Phe Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys 

15 10 15 

Leu His Arg Ala 
20 



<210> 378 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 378 

Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu His Arg Ala Ala 

15 10 15 

Trp Trp Gly Lys 
20 



<210> 379 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 379 

Gly Glu Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val 

15 10 15 

Pro Arg Lys Asp 
20 



<210> 380 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 380 

Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu 

15 10 15 

He Val Met Leu 
20 



<210> 381 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 381 

Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu lie Val Met Leu Arg 

15 10 15 

Asp Thr Asp Val 
20 



<210> 382 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 382 

Val Pro Arg Lys Asp Leu lie Val Met Leu Arg Asp Thr Asp Val Asn 

15 10 15 

Lys Lys Asp Lys 
20 



<210> 383 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 383 

Leu lie Val Met Leu Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin 

15 10 15 

Lys Arg Thr Ala 
20 



<210> 384 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 384 

Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala Leu 

15 10 15 

His Leu Ala Ser 
20 



<210> 385 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 385 

Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala 

15 10 15 

Asn Gly Asn Ser 
20 
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<210> 386 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 386 

Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu 

15 10 15 

Val Val Lys Leu 
20 



<210> 387 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 387 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu 

1 5 10 15 

Leu Asp Arg Arg 
20 



<210> 388 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 388 

Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu Leu Asp Arg Arg Cys 

1 5 10 15 

Gin Leu Asn Val 
20 



<210> 389 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 389 

Glu Val Val Lys Leu Leu Leu Asp Arg Arg Cys Gin Leu Asn Val Leu 

1 5 10 15 

Asp Asn Lys Lys 
20 



<210> 390 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 390 

Leu Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg 

1 5 10 15 

Thr Ala Leu lie 
20 
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<210> 391 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr Ala Leu lie Lys 

15 10 15 

Ala Val Gin Cys 
20 



<210> 392 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 392 

Leu Asp Asn Lys Lys Arg Thr Ala Leu lie Lys Ala Val Gin Cys Gin 

15 10 15 

Glu Asp Glu Cys 
20 



<210> 393 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 393 

Arg Thr Ala Leu lie Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala 

15 10 15 

Leu Met Leu Leu 
20 



<210> 394 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 394 

Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met Leu Leu Glu 

15 10 15 

His Gly Thr Asp 
20 



<210> 395 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 395 

Gin Glu Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly Thr Asp Pro 
15 10 15 
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Asa lie Pro Asp 
20 



<210> 396 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 396 

Ala Leu Met Leu Leu Glu His Gly Thr Asp Pro Asn lie Pro Asp Glu 

15 10 15 

Tyr Gly Asn Thr 
20 



<210> 397 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 397 

Glu His Gly Thr Asp Pro Asn lie Pro Asp Glu Tyr Gly Asn Thr Thr 

15 10 15 

Leu His Tyr Ala 
20 



<210> 398 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 398 

Pro Asn lie Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala lie 

15 10 15 

Tyr Asn Glu Asp 
20 



<210> 399 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 399 

Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala lie Tyr Asn Glu Asp Lys 

15 10 15 

Leu Met Ala Lys 
20 



<210> 400 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 400 
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Thr Leu His Tyr Ala lie Tyr Asn Glu Asp Lys Leu Met Ala Lys Ala 

15 10 15 

Leu Leu Leu Tyr 
20 



<210> 401 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 401 

lie Tyr Asn Glu Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly 

15 10 15 

Ala Asp lie Glu 
20 



<210> 402 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 402 

Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser 

15 10 15 

Lys Asn Lys His 
20 



<210> 403 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 403 

Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu Ser Lys Asn Lys His Gly 

15 10 15 

Leu Thr Pro Leu 
20 



<210> 404 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 404 

Gly Ala Asp lie Glu Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu 

15 10 15 

Leu Gly Val His 
20 



<210> 405 
<211> 20 
<212> PRT 

<213> Homo sapiens 
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<400> 405 

Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu Leu Gly Val His Glu 

15 10 15 

Gin Lys Gin Gin 
20 



<210> 406 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 406 

Gly Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val 

15 10 15 

Val Lys Phe Leu 
20 



<210> 407 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 407 

Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val Lys Phe Leu He 

15 10 15 

Lys Lys Lys Ala 
20 



<210> 408 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 408 

Glu Gin Lys Gin Gin Val Val Lys Phe Leu He Lys Lys Lys Ala Asn 

15 10 15 

Leu Asn Ala Leu 
20 



<210> 409 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 409 

Val Val Lys Phe Leu He Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp 

15 10 15 

Arg Tyr Gly Arg 
20 



<210> 410 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 410 

lie Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arg Tyr Gly Thr Arg 

15 10 15 

Ala Leu lie Leu 
20 



<210> 411 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 411 

Asn Leu Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu lie Leu Ala 

15 10 15 

Val Cys Cys Gly 
20 



<210> 412 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 412 

Asp Arg Tyr Gly Arg Thr Ala Leu lie Leu Ala Val Cys Cys Gly Ser 

15 10 15 

Ala Ser lie Val 
20 



<210> 413 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 413 

Thr Ala Leu lie Leu Ala Val Cys Cys Gly Ser Ala Ser lie Val Ser 

15 10 15 

Leu Leu Leu Glu 
20 



<210> 414 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 414 

Ala Val Cys Cys Gly Ser Ala Ser He Val Ser Leu Leu Leu Glu Gin 

15 10 15 

Asn He Asp Val 
20 



W O03013431 f http://www.qetthepatent.^ 



WO 03/013431 PCTYUS02/24917 

138 



<210> 415 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 415 

Ser Ala Ser He Val Ser Leu Leu Leu Glu Gin Asn He Asp Val Ser 

1 5 10 15 

Ser Gin Asp Leu 
20 



<210> 416 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 416 

Ser Leu Leu Leu Glu Gin Asn He Asp Val Ser Ser Gin Asp Leu Ser 

15 10 15 

Gly Gin Thr Ala 
20 



<210> 417 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 417 

Gin Asn He Asp Val Ser Ser Gin Asp Leu Ser Gly Gin Thr Ala Arg 

15 10 15 

Glu Tyr Ala Val 
20 



<210> 418 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 418 

Ser Ser Gin Asp Leu Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser 

15 10 15 

Ser His His His 
20 



<210> 419 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 419 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 

15 10 15 

He Cys Gin Leu 
20 
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<210> 420 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 420 

Arg Glu Tyr Ala Val Ser Ser His His His Val lie Cys Gin Leu Leu 

15 10 15 

Ser Asp Tyr Lys 
20 



<210> 421 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 421 

Ser Ser His His His Val lie Cys Gin Leu Leu Ser Asp Tyr Lys Glu 

15 10 15 

Lys Gin Met Leu 
20 



<210> 422 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 422 

Val lie Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys 

15 10 15 

lie Ser Ser Glu 
20 



<210> 423 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 423 

Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys lie Ser Ser Glu Asn 

15 10 15 

Ser Asn Pro Glu 
20 



<210> 424 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 424 

Glu Lys Gin Met Leu Lys lie Ser Ser Glu Asn Ser Asn Pro Glu Asn 
15 10 15 
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Val Ser Arg Thr 
20 



<210> 425 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 425 

Met Leu Lys lie Ser Ser Glu Asn Ser Asn Pro Glu Asn Val Ser Arg 

15 10 15 

Thr Arg Asn Lys 
20 

<210> 426 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 426 

Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe Pro Cys Cys Arg Glu 

15 10 15 

Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asp Asp Ser Ala 
20 25 30 

Met 



<210> 427 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 427 

Ser Lys Met Gly Lys Trp Cys Arg His Cys Phe Pro Cys Cys Arg Gly 

15 10 15 

Ser Gly Lys Ser Asn Val Gly Ala Ser Gly Asp His Asp Asp Ser Ala 
20 25 30 

Met 



<210> 428 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 428 

Asn Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly 

15 10 15 

Ser Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala 
20 25 30 

Phe 



